The 100 Biggest Public Companies in the World in 2022
/r/Infographics
https://redd.it/zx9sfc
Spreadsheet of all FIFA 23 fut player data (OC)
https://docs.google.com/spreadsheets/d/1zdnAtkEwM8p21PsPP1KJg7vJuXWzvcWDYT8mxEt3Omw/edit?usp=sharing
/r/datasets
https://redd.it/zwp1q1
How do you pluralize English words? (All)
Google forms link
/r/SampleSize
https://redd.it/zwt5x4
[OC] November 2022 for five Boeing 737-800s
/r/dataisbeautiful
https://redd.it/zwo89h
Datasets - Cheminformatics related data
Can somebody give pointers as to where I can find antibody/protein/peptide datasets which contain experimental information related to,
* plasma half-life,
* thermal stability,
* solubility,
* Aggregation propensity,
* Immunogencity.
I need it for data mining and analytics.
/r/datasets
https://redd.it/zu017x
Top 50 Big Data Analytics Tools and Software You should know in 2023
https://bigdataanalyticsnews.com/top-big-data-analytics-tools/
/r/datasets
https://redd.it/zvj7pz
[OC] User m-rage posted their flower blooming records here, but that was deleted for lacking computer-generated content. I entered it into a spreadsheet, added graphs plus some date analysis. I used OpenOffice Calc. [13778x3084]
https://redd.it/zw044u
@datascientology
Advice for finding a detailed dataset on stocks listed in the S&P 500?
I need to find a dataset for a college course. I'm interested in finance and would like to do some exploratory analysis on S&P 500 stocks. I've already found something that is almost perfect for what I want to do: S&P 500 Companies with Financial Information | Kaggle
However, I would prefer a dataset of the exact same format but that has even more columns. I would especially like to have more categorical columns, since the only truly categorical one from this one is the sector. I know that there are a lot more columns that could make sense for such a dataset, such as historical return over the past N years, or other financials. So I really do feel like there must be some dataset out there that looks exactly like this but has even more columns.
I'm just having trouble with finding such a dataset. So far I've just been googling something like "dataset of s&p 500 stocks with financial information" or "dataset of s&p 500 stocks with many columns," but I haven't really found anything better yet by doing so.
I would appreciate any suggestions.
/r/datasets
https://redd.it/zw6lyy
Steam's average player does not obey the rules of scale
/r/dataisugly
https://redd.it/zvvj72
America's car reliance: getting to work across 48 states mapped
/r/MapPorn
https://redd.it/zvyc67
[OC] - My 2022 Spending Breakdown - 25M - (Sankeymatic)
/r/dataisbeautiful
https://redd.it/zvpuvt
Introducing BastionLab - Collaborate with our simple privacy framework for data visualization!
📈 We’re thrilled to introduce BastionLab, our open-source and simple privacy framework for data science collaboration!
To see what plotting looks like when privacy issues are automatically handled for you, you can check our GitHub or directly go to our Visualization tutorial 📊
### Built for sensitive data collaboration
Collaboration between data owners and data scientists is a big challenge for highly regulated fields like health, finance, or advertising due to security and privacy issues. When collaborating remotely, data owners have to open their whole dataset, often through a Jupyter notebook. This too-broad access creates huge privacy gaps because too many operations are allowed, which enables data scientists to extract information from the remote infrastructure (print the whole database, save the dataset in the weights, etc).
⚙️ BastionLab solves this problem by providing fine-grained access control. It guarantees data owners that data scientists can only perform privacy-friendly operations on their data and that only anonymized outputs are shared with them.
### How does BastionLab work?
BastionLab makes sure that the data owner’s remote data is never accessed directly by the data scientist. Three main elements ensure this:
- First, a ‘safe zone’ is defined by the data owner to filter the data scientist’s queries, which enforces control while allowing for interactivity.
- Second, expressivity is limited. This means that the type of operations that can be executed by the data scientists is restricted to avoid arbitrary code execution.
- Finally, the data scientist never accesses the dataset locally. They only manipulate a local object that contains metadata to interact with the remotely hosted dataset - and data owners can always see the calls made by that object.
### Ready to try?
If you like the project, drop a ⭐ on our GitHub! We’re open-source, so it’s a big help ^^
/r/visualization
https://redd.it/zspjml
E Is there a major difference between possible job opportunities of MS Stats and MS Biostats holders?
I basically have three options: go into work force with my B.A, get an MS in stats, or get an MS in biostats. My degree is in Econ & Math and I'm leaning towards wanting to work in finance on the analytics/DS side or maybe as a quant dev (completely different path).
The reason I 'm leaning towards getting a masters is because I've read and seen salary data suggesting MS stats holders make significantly more than their bachelor holding counterparts, plus many of the more technical positions require an MS.
I don't really want to work in the biostats fields, its more of a backup if I don't get into the normal stats masters, as the foundations are the same. So I'm just wondering if a masters in biostats is a deterrent for those wanting to go into finance/tech? Thanks.
/r/statistics
https://redd.it/zv20jh
[OC] How to remember the name (now with sound)
/r/dataisbeautiful
https://redd.it/zx96hp
Should r/SampleSize accept images in posts? Results
, - ~ ~ ~ - ,
, ' \ ' ,
, \ 33.3% ,
, \ Nope ,
, \ ,
, _,
, ,
, 66.7% ,
, Sure ,
, , '
' - , , '
N = 96
/r/SampleSize
https://redd.it/zw7mrc
Announcements: Image posts have returned! (Those who share results on r/SampleSize)
Users of r/SampleSize from before the last wave of moderators rejoice! After a couple people requested the return of image posts, including most recently u/ToLoveThemAll, we've hashed out in the background how it'll work, and now image posts are allowed when using the **Results** flair! They will work as follows.
* You have the option to make an Image post to have Reddit host an image to our subreddit.
* Results is the only flair that images will be allowed, any other flair posting an image will be removed.
* Results-flaired-image-posts will still be filtered, and will be pushed forward on an approval basis. We will receive a modmail every time a user attempts to post using the Results flair, so we can manually approve images and threads.
/r/SampleSize
https://redd.it/zw80ic
[OC] Breakdown of how dating went for me in 2022 as a 22M
/r/dataisbeautiful
https://redd.it/zw4e8l
The Endangered Alphabets Project is looking for volunteers
https://mobile.twitter.com/TBAlphabets/status/1607436824988852230
/r/datasets
https://redd.it/zvxy8n
[OC] 30 Most &. least prosperous countries in the world according to Legatum Prosperity Index.
/r/dataisbeautiful
https://redd.it/zwehys
Yearly Deaths by Natural Disaster, going backwards from 2021 to 1900 [OC]
/r/dataisbeautiful
https://redd.it/zw5fjs
Data science & analytics style guide and best practices.
Hi! I've been workin on DA for a bit more than a year now and I love it but I haven't found good documentation about it.
I see a lot of resources with a strong focus on tools (softwares or languages e.g.) but laking rigurosity on definitions and best practices.
I would like to find some book or documentation regarding the following topics:
Definitions: Dimentions, facts, types of analytics, wrangling, cleaning etc.
Best practices: recomended procedures for ETL, querys etc.
Thanks a Lot!
/r/datascience
https://redd.it/zw04md
[OC] North American cities by number of major sports championships (Updated December 2022)
/r/dataisbeautiful
https://redd.it/zvw7pf
[OC] Every High School Baseball Field Used in the State of West Virginia
/r/dataisbeautiful
https://redd.it/zvtef4
[OC] US city housing price changes since 1991
/r/dataisbeautiful
https://redd.it/zvt54h
Global Terrorism in the 21st Century: A Map of Attacks and Incidents
/r/MapPorn
https://redd.it/zvmz1a