Sitemap - 2024 - MLOps Newsletter
Google builds UniAR, AirbnB uses ViTs!
Uber's in-house and open-source LLM Training Stack
Alibaba Foundation Models: QWEN Series
Pruning Aware Training(PAT) in LLMs
Eureka: OSS Framework to evaluate LLMs
State Space Sequence Models over Transformers?
Scaling and Reliability Challenges of LLama3
Mechanistic Interpretability, Linear Representation Hypothesis, Sparse AutoEncoders and All That
Hallucination Attenuated Language and Vision Assistant(Halva) from Google
Quantization Aware Training in PyTorch
Llama 3.1 launched and it is gooooood!
Gemini to migrate code, Gemini to do Automatic Speech Recognition
On the Factory Floor: ML Engineering for Industrial-Scale Ads Recommendation Models from Google
Kafka Tiered Storage from Uber
How to build an infrastructure from scratch to train a 70B Model?
What if LLM is the ultimate data janitor
Apple announces Apple Intelligence
Foundational Models and Compute Trends
ThunderKittens to make the GPUS go brr
Compute and the role it plays in AI
Personalizing Heart Rate Prediction
Pathscopes: Inspect Hidden Representation of Neural Networks!
Llama3 is out and it is awesome!
Pinterest's Text to SQL system through LLMs!
Pinterest introduces LinkSage, Google combines Neural Networks with Bayesian theory
Representation Engineering for Control Vector
Google open-sources Gemma(2B, 7B parameter models)
Compound AI Systems over Vanilla LLMs
Small Language Models(SLM): Phi-2!
Graph Neural Networks in Tensorflow
Exphormer(Graph Neural Networks)
Google announces AI system for diagnostic medical reasoning and conversation