Sitemap - 2024 - MLOps Newsletter

Speculative Decoding for LLM

Google builds UniAR, AirbnB uses ViTs!

Uber's in-house and open-source LLM Training Stack

Alibaba Foundation Models: QWEN Series

Pruning Aware Training(PAT) in LLMs

Eureka: OSS Framework to evaluate LLMs

State Space Sequence Models over Transformers?

DataGemma through RIG and RAG

Scaling and Reliability Challenges of LLama3

Mechanistic Interpretability, Linear Representation Hypothesis, Sparse AutoEncoders and All That

Hallucination Attenuated Language and Vision Assistant(Halva) from Google

Quantization Aware Training in PyTorch

Llama 3.1 launched and it is gooooood!

Gemini to migrate code, Gemini to do Automatic Speech Recognition

On the Factory Floor: ML Engineering for Industrial-Scale Ads Recommendation Models from Google

Missing torch.compile Manual

Kafka Tiered Storage from Uber

How to build an infrastructure from scratch to train a 70B Model?

What if LLM is the ultimate data janitor

Apple announces Apple Intelligence

Foundational Models and Compute Trends

ThunderKittens to make the GPUS go brr

GPT-4o

Compute and the role it plays in AI

Personalizing Heart Rate Prediction

Pathscopes: Inspect Hidden Representation of Neural Networks!

Llama3 is out and it is awesome!

Pinterest's Text to SQL system through LLMs!

DSPy through a RAG System

Pinterest introduces LinkSage, Google combines Neural Networks with Bayesian theory

Mamba ands DSPy explained!

X.ai releases Grok-1!

Representation Engineering for Control Vector

Google open-sources Gemma(2B, 7B parameter models)

Compound AI Systems over Vanilla LLMs

Small Language Models(SLM): Phi-2!

OpenAI Releases Sora!

Graph Neural Networks in Tensorflow

Exphormer(Graph Neural Networks)

Modular Deep Learning

Google announces AI system for diagnostic medical reasoning and conversation

Monarch Matrices(M2) instead of Transformers?

What happened in 2023