4,828 questions
Advice
0
votes
1
replies
54
views
Adjusting Titles to reflect a change in numerical data
Ok so there's no easy way to title this to get the information I really want to get at. But I feel like the longer winded explanation is fairly simple.
So I'm in a programming for data science class (...
Advice
0
votes
0
replies
36
views
RTOS connection with analytics
My data in firebase is getting updated by nodered and is being shown in firebase realtime database.Its has temperature,pressure,humidity.I want to analyze in firebase itself using Google analytics.Can ...
Advice
1
vote
0
replies
34
views
r2 from wblplot (built-in function)
Using the built-in function wblplot (https://uk.mathworks.com/help/stats/wblplot.html#d126e1264612) to analyse some data.
How do I quantify the goodness of fit (e.g. r^2) of the plot it produces? The ...
Advice
0
votes
3
replies
90
views
How to remember all SQL codes effectively?
I'm currently taking the Google Data Analytics certificate, and the SQL code is a bit overwhelming. How can I remember them so whenever there is a need for help, I can do the job. Any advice or tips ...
Advice
0
votes
4
replies
82
views
How does one make a player diagram/visualization like the one below within Python?
I'm quite the baseball fan, and if you are as well, you've probably seen the X profile of the user Pitcher Profiler creates these awesome post-game reports for players, and one of the visualizations ...
Best practices
0
votes
1
replies
60
views
When should data go to Archive vs Reject in Bronze layer (Medallion Architecture)?
Can anybody help with understanding the Archive and Reject folders in bronze layer at Medallion Architecture. Let say i have 4 folders in Bronze namely Raw, Stage, Archive and Reject. At what extent a ...
Advice
1
vote
3
replies
74
views
How to handle heteroskedasticity when detecting anomalies using Z-scores on growth rates?
I’m trying to detect anomalies in a dataset using Z-scores based on the logged index change of a value between two time periods: VAL_t0 and VAL_t1.
The issue:
The variance of Z-scores decreases as ...
Advice
0
votes
1
replies
48
views
Analyze a directory in a performant (cross-platform) way for what file types (file extensions) it (recursively) contains?
Aim
My aim is to analyze a (big) (sub)directory and just find out what file extensions all files have there (recursively).
Additionally, these conditions apply:
I am on Windows, but I could use WSL ...
2
votes
1
answer
112
views
Multiple variable/correlation analysis using Python
I have a machine with up to 58 indipendent input variables and one response variable. If I create 2D-histogram plots of all combinations of 2 input variables and the response variable, I get plots ...
0
votes
0
answers
47
views
Unable to fetch Accurate Performance Max (PMAX) YouTube Video Metrics via Google Ads Script / BigQuery Transfer
I’m currently working on a task to fetch and display daily Google ADS Manager (GAM) records—such as Cost, ROAS, and other metrics—within a data analysis application. I’ve successfully retrieved data ...
-3
votes
2
answers
160
views
How to find the correlation between the two most commonly sold items
How to find the correlation between the two most commonly sold items?
I have data of items sold in the supermarket. I want to summarise:
what are the most commonly sold items together?
what is the ...
2
votes
1
answer
156
views
FFT-based quasi-steady detection issue
I am trying to detect the beginning of a quasi-steady regime in time series data representing drag (Fx) and lift (Fy) forces after an initial transient.
Initially, I used a slope-based method, but it ...
1
vote
1
answer
52
views
How to prefix a specific series of lines (multiple)?
I have a text file, which has an inconsistent timestamp format, that I would like to standardize. This is in a transcript from an interview; ultimately for textual analysis.
What command could I run ...
0
votes
0
answers
46
views
Creating a Line graph & Matrix showing % Difference from Previous Year, allowing for a filter on "Country"
I have raw data with columns "Year", "Period", "Country", "Sales Amount". Please note there are 13 periods in a year, (and not 12). The dates also differ ...
0
votes
0
answers
168
views
Why my Transformer model did not work well when dealing with single cell multi-omic data
The complete codes and data are available at:Google Disk
I'm working on a high-dimensional regression problem and have built a Transformer-based model in PyTorch. While the model trains, I'm observing ...