Machine learning and other gibberish
See also: https://sharing.leima.is
Archives: https://datumorphism.leima.is/amneumarkt/
See also: https://sharing.leima.is
Archives: https://datumorphism.leima.is/amneumarkt/
#dev
https://substack.com/redirect/4af1baf3-a4d7-48d1-9592-5ea0971a056a?j=eyJ1IjoiNHowa3gifQ.QnwKDJ1CRSD1ToSPhPzIWMi45g-Rid7OgDj8cqSear0
Sounds good but please DO NOT USE upper cases. It is not only about PEP8 but more about consistency and cognitive load.
We solve this problem by writing down the dimensions in the docstrings and also include the math expressions there. But it is already obvious that writing down the dimensions in the var names makes things much easier.
https://substack.com/redirect/4af1baf3-a4d7-48d1-9592-5ea0971a056a?j=eyJ1IjoiNHowa3gifQ.QnwKDJ1CRSD1ToSPhPzIWMi45g-Rid7OgDj8cqSear0
Sounds good but please DO NOT USE upper cases. It is not only about PEP8 but more about consistency and cognitive load.
We solve this problem by writing down the dimensions in the docstrings and also include the math expressions there. But it is already obvious that writing down the dimensions in the var names makes things much easier.
#misc
Animation of some periodic 3-body problems. How pretty.
https://en.wikipedia.org/wiki/Three-body_problem#/media/File:5_4_800_36_downscaled.gif
Animation of some periodic 3-body problems. How pretty.
https://en.wikipedia.org/wiki/Three-body_problem#/media/File:5_4_800_36_downscaled.gif
#ai
Google released new models. Those models are easier to use than llama 2.
https://www.kaggle.com/models/google/gemma
https://blog.google/technology/developers/gemma-open-models/
Google released new models. Those models are easier to use than llama 2.
https://www.kaggle.com/models/google/gemma
https://blog.google/technology/developers/gemma-open-models/
#ml
Like a dictionary
Kunc, Vladim’ir, and Jivr’i Kl’ema. 2024. “Three Decades of Activations: A Comprehensive Survey of 400 Activation Functions for Neural Networks.” arXiv [Cs.LG], February. http://arxiv.org/abs/2402.09092.
Like a dictionary
Kunc, Vladim’ir, and Jivr’i Kl’ema. 2024. “Three Decades of Activations: A Comprehensive Survey of 400 Activation Functions for Neural Networks.” arXiv [Cs.LG], February. http://arxiv.org/abs/2402.09092.
Walking in the dark is different for women and men.
Chaney, Robert A., Alyssa Baer, and L. Ida Tovar. 2023. “Gender-Based Heat Map Images of Campus Walking Settings: A Reflection of Lived Experience.” Violence and Gender, December. https://doi.org/10.1089/vio.2023.0027.
#ai
Gemini Ultra vs GPT-4: Google Still Lacks the Secret Sauce | Beebom
https://beebom.com/gemini-ultra-vs-gpt-4/
Gemini Ultra vs GPT-4: Google Still Lacks the Secret Sauce | Beebom
https://beebom.com/gemini-ultra-vs-gpt-4/
#ml
I got interested in satellite data last year and played with it a bit. It's fantastic. The spatiotemporal nature of it brings up a lot of interesting questions.
Then I saw this paper today:
Rolf, Esther, Konstantin Klemmer, Caleb Robinson, and Hannah Kerner. 2024. “Mission Critical -- Satellite Data Is a Distinct Modality in Machine Learning.” arXiv [Cs.LG], February. http://arxiv.org/abs/2402.01444.
I got interested in satellite data last year and played with it a bit. It's fantastic. The spatiotemporal nature of it brings up a lot of interesting questions.
Then I saw this paper today:
Rolf, Esther, Konstantin Klemmer, Caleb Robinson, and Hannah Kerner. 2024. “Mission Critical -- Satellite Data Is a Distinct Modality in Machine Learning.” arXiv [Cs.LG], February. http://arxiv.org/abs/2402.01444.
#ml
Jelassi S, Brandfonbrener D, Kakade SM, Malach E. Repeat after me: Transformers are better than state space models at copying. arXiv [cs.LG]. 2024. Available: http://arxiv.org/abs/2402.01032
Not surprising at all when you have direct access to a long context. But hey, look at this title.
Jelassi S, Brandfonbrener D, Kakade SM, Malach E. Repeat after me: Transformers are better than state space models at copying. arXiv [cs.LG]. 2024. Available: http://arxiv.org/abs/2402.01032
Not surprising at all when you have direct access to a long context. But hey, look at this title.
#ai
Allen AI's new model based on its high quality text including academic publication corpus.
OLMo Suite - a allenai Collection
https://huggingface.co/collections/allenai/olmo-suite-65aeaae8fe5b6b2122b46778
Allen AI's new model based on its high quality text including academic publication corpus.
OLMo Suite - a allenai Collection
https://huggingface.co/collections/allenai/olmo-suite-65aeaae8fe5b6b2122b46778
#misc
Germany, France and Poland announce the ‘Weimar triangle’ for artificial intelligence | Science|Business
https://sciencebusiness.net/news/ai/germany-france-and-poland-announce-weimar-triangle-artificial-intelligence
Germany, France and Poland announce the ‘Weimar triangle’ for artificial intelligence | Science|Business
https://sciencebusiness.net/news/ai/germany-france-and-poland-announce-weimar-triangle-artificial-intelligence
#ai
It seems that Huggingface has a lot of bargaining power.
https://www.googlecloudpresscorner.com/2024-01-25-Google-Cloud-and-Hugging-Face-Announce-Strategic-Partnership-to-Accelerate-Generative-AI-and-ML-Development
It seems that Huggingface has a lot of bargaining power.
https://www.googlecloudpresscorner.com/2024-01-25-Google-Cloud-and-Hugging-Face-Announce-Strategic-Partnership-to-Accelerate-Generative-AI-and-ML-Development