The most expensive buildings ever constructed
/r/Infographics
https://redd.it/104499i
[OC] The number of Representatives in the US Congress per million Capita
/r/dataisbeautiful
https://redd.it/104fd75
Percentage of arable land of a countrys whole area
/r/MapPorn
https://redd.it/104bs2b
[OC] Country Distribution of Top 500 Companies by Market Capitalization
/r/dataisbeautiful
https://redd.it/10421kr
Resources on best practices for professional projects (e.g. version control, virtual environments, debugging, documenting code)
Hi everybody,
I've recently finished my Master's and am currently applying for data scientist/analyst positions. During my studies, I have acquired a decent knowledge of different algorithms, data cleaning, visualization, evaluation, etc. However, I feel like I didn't learn a lot of the fundamentals around any projects. That is, how to properly document code, use virtual environments and version control, etc. Basically, anything that will make a project seem professional and easily usable by others.
Are you aware of any resources on such best practices? For example, a course dedicated to a bunch of them or individual resources to help me to get a better understanding of how to create a professional project.
I would appreciate any pointers! :)
/r/datascience
https://redd.it/103y1yh
Q Which statistical methods became obsolete in the last 10-20-30 years?
In your opinion, which statistical methods are not as popular as they used to be? Which methods are less and less used in the applied research papers published in the scientific journals? Which methods/topics that are still part of a typical academic statistical courses are of little value nowadays but are still taught due to inertia and refusal of lecturers to go outside the comfort zone?
/r/statistics
https://redd.it/103utzy
"I'm gonna make him a Neural Network he can't refuse" - Godfather of AI
/r/datascience
https://redd.it/103zgua
Repost Would you want to know early if you were going to develop a neurodegenerative disease? (UK Residents, 18+)
If you have a spare 10 minutes please consider completing our role play study from Newcastle University! You will be taken through a hypothetical scenario of receiving an eye test, offered a new tool to screen for neurodegenerative diseases, and asked some questions. https://nclpsych.eu.qualtrics.com/jfe/form/SV\_bar3BNC588x3Yfc
/r/SampleSize
https://redd.it/103tpwp
John Snow's 1854 cholera map of London that changed epidemiology forever; showing cases concentrated around the Broad Street water pump
https://www.theguardian.com/news/datablog/interactive/2013/mar/15/cholera-map-john-snow-recreated
/r/dataisbeautiful
https://redd.it/103r1pz
GNC Brand protein powder... only about 5% better than the placebo.
/r/dataisugly
https://redd.it/103gnf4
2022 Dec winter event power outage time-lapse [OC]
/r/dataisbeautiful
https://redd.it/103dlvj
What’s your favourite swear word? (Everyone 13+)
Anyone can take this survey, as all of the demographic questions are optional and most include an “other” option to be inclusive of options that I may have not thought of.
https://docs.google.com/forms/d/e/1FAIpQLSfp8kTuUfFBQ1funASmvQ-jsJh8qpb4erOluQgGWBNYh2sgw/viewform?usp=sflink
/r/SampleSize
https://redd.it/1046vud
University of Georgia historian Claudio Saunt created an interactive map showing the decline of Indian/Native homelands from 1776 to 1887. Along with Slate's Rebecca Onion, he turned that map into a GIF, showing just how rapidly European-Americans took what amounted to over 1.5 billion acres:
/r/MapPorn
https://redd.it/1048eor
Introducing Jupyter Scheduler
https://blog.jupyter.org/introducing-jupyter-scheduler-f9e82676c388
/r/IPython
https://redd.it/zm3wbo
[OC] Republicans are more likely to be happy and satisfied compared to Democrats
/r/dataisbeautiful
https://redd.it/103yggi
Here’s a playlist of 7 hours of music with NO VOCALS I use to focus when I’m coding /learning . Post yours as well if you also have one!
Spotify | Apple | Youtube | Amazon
/r/bigdata
https://redd.it/zelzp5
U.S. Cities Where the Most People Take a Motorcycle to Work
/r/Infographics
https://redd.it/1042gs0
Mass Shootings in the USA during 2022 [OC]
/r/dataisbeautiful
https://redd.it/103u0yu
MapPorn Discussion Thread for January, 2023
This thread is for general MapPorn discussion. Exchange ideas, ask for maps, talk about cartography, etc. Have a thought that doesn't fit in another thread, post it here.
/r/MapPorn
https://redd.it/100jwix
Temperature blankets. 365 columns displaying temperature ranges each day of the year. Bottom one is Hayward, Ca 2019. Top one is Tracy, Ca 2022. Made by M.I.L! Enjoy!
https://redd.it/103kr1b
@datascientology
[OC] The Most Popular Video Game Console In Each State
/r/visualization
https://redd.it/101spth
N Legal NLP Dataset With Over 39,000 Examples Released
Legal datasets are extremely expensive because lawyers are, and this has bottlenecked legal NLP.
To address this, we release the Merger Agreement Understand Dataset (MAUD), with over 39,000 multiple-choice reading comprehension examples for 152 merger agreements that have been manually labeled by legal experts. The dataset was created with the help of the American Bar Association; without their help the dataset would have cost over $5,000,000 to create.
MAUD has substantial room for improvement and can could serve as a research challenge for NLP researchers without any legal background.
Dataset and Baselines: https://github.com/TheAtticusProject/maud/
Paper: https://arxiv.org/abs/2301.00876
/r/MachineLearning
https://redd.it/103b1ck