Hot data science related posts every hour. Chat: https://telegram.me/r_channels Contacts: @lgyanf
The Biggest Source of Power in Every State and Province [OC]
/r/dataisbeautiful
https://redd.it/z5bjhk
Countries with recorded temperature extremes above 48°C and under -48°C
/r/MapPorn
https://redd.it/z5i5kz
Most important skills to cultivate
I’m finishing a physics/astronomy program in about a year and have a few elective spots open. I’ve heard data science is a good route for math/physics people. What kind of skills are most important to get your foot in the door and which classes would help most with those? Thanks!
/r/datascience
https://redd.it/z4spvt
[OC] The Slow Decline of Key Changes in Popular Music
/r/dataisbeautiful
https://redd.it/z5dty1
LGBT+ Rights in the Middle East.
/r/MapPorn
https://redd.it/z4zh7m
Original Data > Processing > Output [OC]
https://redd.it/z51gpw
@datascientology
My iPod touch decided to display 58% of battery charge as roughly 75% on the lock screen
/r/dataisugly
https://redd.it/z4gex7
Got promoted to manage a small team (less than 4)
So I got promoted and now will manage a small team. We do a mix of BI and basic datascience. Any tips how to organize the work for a small team from your experience? Or any other tips for that matter
Thanks
/r/datascience
https://redd.it/z4q7sg
[OC] Top 10 largest oil fields by 2021 production
/r/dataisbeautiful
https://redd.it/z4ipyx
[OC] - Google searches for "food bank"
/r/dataisbeautiful
https://redd.it/z4ut5j
[OC] The Largest Entertainment Streaming Companies
/r/visualization
https://redd.it/z4le9z
The Dangers of Correlation implying Causation (examples)
https://redd.it/z4vajo
@datascientology
[OC] The Largest Entertainment Streaming Companies
/r/dataisbeautiful
https://redd.it/z4le59
R Robust Learning: the past and present. The DNN has strong fitting capability, but we find ...
ProSelfLC: Progressive Self Label Correction Towards A Low-Temperature Entropy State
arXiv: https://arxiv.org/abs/2207.00118
Code: https://github.com/XinshaoAmosWang/DeepCriticalLearning
​
​
https://preview.redd.it/0vtst8xkk22a1.png?width=1181&format=png&auto=webp&s=a07a443bb633e7efacd4c41d7941ec6591e46d25
​
https://preview.redd.it/jtc5n5jlk22a1.png?width=1195&format=png&auto=webp&s=2643109cd59c2b130bbd3f8d8aaeb7054a68ec3e
​
https://preview.redd.it/3c8c9i7mk22a1.png?width=1239&format=png&auto=webp&s=c670ab84fe973ae34326b8cd86652c25ebaadc50
/r/MachineLearning
https://redd.it/z49k7x
Q What is the logic behind a low p-value being favorable?
I need help understanding why a low P-value is favorable. Wouldn't you want a high P-value to indicate that the experiment results are consistent with the same experiment being run on other sample groups? For example, If I found there was a 4" height difference between men and women in my sample group, and I ran the t-test and got a p-value of 0.95, wouldn't that mean that 19/20 times this experiment is run on different sample groups there will be a 4" height difference? So I could say more confidently that there is in fact, a 4" height difference?
/r/statistics
https://redd.it/z3yohh
C End of year Salary Sharing thread
This is the official thread for sharing your current salaries (or recent offers) for the end of 2022.
Please only post salaries/offers if you're including hard numbers, but feel free to use a throwaway account if you're concerned about anonymity. You can also generalize some of your answers (e.g. "Large CRO" or "Pharma"), or add fields if you feel something is particularly relevant.
1. Title(e.g statistical programmer, biostatistician, statistical analyst, data scientist):
2. Country/Location:
3. $Remote:
4. Salary:
5. Company/Industry:
6. Education:
7. Total years of Experience:
8. $Internship
9. $Coop
10. Relocation/Signing Bonus:
11. Stock and/or recurring bonuses:
12. Total comp:
Note that while the primary purpose of these threads is obviously to share compensation info, discussion is also encouraged.
/r/statistics
https://redd.it/z5cpvm
[OC] Percentage of quiz takers who knew each country in a 'Name the Countries of the World" quiz.
/r/dataisbeautiful
https://redd.it/z5dul3
[OC] - US Yield Curve, mean yield curve spread, and percent of all yield curve combinations that are inverted
/r/dataisbeautiful
https://redd.it/z5a7x0
D Paper Explained - CICERO: An AI agent that negotiates, persuades, and cooperates with people (Video)
https://youtu.be/ciNMc0Czmfc
A team from Meta AI has developed Cicero, an agent that can play the game Diplomacy, in which players have to communicate via chat messages to coordinate and plan into the future.
​
OUTLINE:
0:00 - Introduction
9:50 - AI in cooperation games
13:50 - Cicero agent overview
25:00 - A controllable dialogue model
36:50 - Dialogue-conditional strategic planning
49:00 - Message filtering
53:45 - Cicero's play against humans
55:15 - More examples & discussion
​
Homepage: https://ai.facebook.com/research/cicero/
Code: https://github.com/facebookresearch/diplomacy\_cicero
Blog: https://ai.facebook.com/blog/cicero-ai-negotiates-persuades-and-cooperates-with-people/
Paper: https://www.science.org/doi/10.1126/science.ade9097
​
Abstract:
Despite much progress in training AI systems to imitate human language, building agents that use language to communicate intentionally with humans in interactive environments remains a major challenge. We introduce Cicero, the first AI agent to achieve human-level performance in Diplomacy, a strategy game involving both cooperation and competition that emphasizes natural language negotiation and tactical coordination between seven players. Cicero integrates a language model with planning and reinforcement learning algorithms by inferring players' beliefs and intentions from its conversations and generating dialogue in pursuit of its plans. Across 40 games of an anonymous online Diplomacy league, Cicero achieved more than double the average score of the human players and ranked in the top 10% of participants who played more than one game.
​
Authors: Anton Bakhtin, Noam Brown, Emily Dinan, Gabriele Farina, Colin Flaherty, Daniel Fried, Andrew Goff, Jonathan Gray, Hengyuan Hu, Athul Paul Jacob, Mojtaba Komeili, Karthik Konath, Minae Kwon, Adam Lerer, Mike Lewis, Alexander H. Miller, Sasha Mitts, Adithya Renduchintala, Stephen Roller, Dirk Rowe, Weiyan Shi, Joe Spisak, Alexander Wei, David Wu, Hugh Zhang, Markus Zijlstra
/r/MachineLearning
https://redd.it/z4s2kp
Sunrise and sunset times throughout the year, arranged in a circle [OC]
/r/dataisbeautiful
https://redd.it/z55n18
[P] I trained a dog to fetch a stick using Deep Reinforcement Learning
/r/MachineLearning
https://redd.it/z52bsl
The 2022 brain
/r/funnycharts
https://redd.it/yw768h
P OpenELM, a library combining evolutionary algorithms and language models
Hi all,
This is a new library combining large language models with evolutionary algorithms for code synthesis, by CarperAI.
Github: https://github.com/CarperAI/OpenELM
Huggingface model: https://huggingface.co/CarperAI/diff-codegen-350m
Blog post: https://carper.ai/openelm-release/
ELM stands for Evolution Through Large Models, a technique from a recent OpenAI paper demonstrating that large language models can act as intelligent mutation operators in an evolutionary algorithm, enabling diverse and high quality generation of code in domains not seen in the language model’s training set.
The library contains an implementation of MAP-Elites with a language model as the mutatation operator, and the Sodaracer 2D environment as a testbed where you can evolve robots with a language model.
In addition, there is also an an open-source diff model fine-tuned on GitHub diffs from Salesforce’ CodeGen 350M code synthesis model, under an MIT license. This diff model will let you more easily generate intelligent code suggestions in ELM.
/r/MachineLearning
https://redd.it/z4pjnt
E Teaching HS students how to conduct survey & write research paper
I’m currently teaching a HS Intro to Stats course. I’d like to give my students a research study to complete during the 2nd semester but they really need a clean, easy to follow framework bc they’ve never done anything like this at all. Curious if other teachers have done this / if anyone has ideas of good resources to provide a guideline? I don’t want it to be too simple but it also can’t be too complex. To be honest, my students aren’t the most motivated in the world so they will need a lot of structure for this. Thx for any thoughts!
/r/statistics
https://redd.it/z4tcoy
Do you guys find D3 useful?
I took 1/2 of a course on how to use D3, and have been regretting abandoning it ever since.
It strikes me as one of those tools that appears to have unlimited creative potential. I'm wondering if it lives up to this in practice.
In your experience how useful do you find D3? Is it "too flexible" & low-level? Or do you often find nice & creative applications for it that make your stakeholders happy? How does it compare to ggplot2 (my current free-form visualization package of choice).
Moreover how often is it necessary to build visualizations "from scratch", rather than using standard pre-packaged options?
/r/datascience
https://redd.it/z4jvy2
Trendeigh: Popularity of "eigh" in US names 1910-2020 [OC]
/r/dataisbeautiful
https://redd.it/z4el7h
Putting the population of Manhattan and the Dakotas side by side
/r/MapPorn
https://redd.it/z4iwmq
Rivers of South America [OC]
/r/dataisbeautiful
https://redd.it/z4l1pj
D First time NeurIPS
I am going to NeurIPS next week. This is the first time I am going to an AI conference, and the first time I am going to a very large conference. I did my PhD in pure math, so I have been to plenty of academic conferences, but they were all smaller (less than 100 people) events. I am presenting a workshop paper and am going alone from Europe.
Anyone have any general tips when going to a large AI conference for the first time?
It would be nice to find some people to have lunch with, or eat dinner with, because in my experience you learn at least as much by talking to people as you do from academic presentations. So I am curious on how the social interactions at these conferences are: do people hang out mostly with their own crowds, or is it easy to get in touch with new people?
I am also vaguely looking for interesting people and places where I might go on a research stay (paid by my job) some time in the future, so that is another motivation for meeting people.
/r/MachineLearning
https://redd.it/z48t6e