{"schema_version":"onlylabs.public_analysis.v1","url":"https://onlylabs.fyi/analysis/meta-ai","json_url":"https://onlylabs.fyi/analysis/meta-ai/analysis.json","evidence_json_url":"https://onlylabs.fyi/analysis/meta-ai/evidence.json","generated_at":"2026-06-11T18:04:57.749Z","analysis":{"org_slug":"meta-ai","url":"https://onlylabs.fyi/analysis/meta-ai","json_url":"https://onlylabs.fyi/analysis/meta-ai/analysis.json","evidence_json_url":"https://onlylabs.fyi/analysis/meta-ai/evidence.json","dossier_url":"https://onlylabs.fyi/labs/meta-ai","org":{"slug":"meta-ai","name":"Meta AI (Llama)","category":"frontier-lab","category_label":"Frontier lab","homepage_url":"https://ai.meta.com"},"title":"Meta AI (Llama) analysis","summary":"Meta AI is the open-weight anchor of the frontier-model field: it ships the Llama family under permissive licenses and lets the ecosystem do distribution, while pivoting its newest generation (Llama 4) to mixture-of-experts. Alongside the models it is building out the surrounding tooling — a hosted Llama API (Python/TypeScript SDKs), the PurpleLlama/Llama-Guard safety stack, and developer cookbooks — and its public…","markdown":"## Thesis\n\nMeta AI is the open-weight anchor of the frontier-model field: it ships the Llama family under permissive licenses and lets the ecosystem do distribution, while pivoting its newest generation (Llama 4) to mixture-of-experts. Alongside the models it is building out the surrounding tooling — a hosted Llama API (Python/TypeScript SDKs), the PurpleLlama/Llama-Guard safety stack, and developer cookbooks — and its public engineering writing is dominated by AI infrastructure and applied-LLM systems work rather than model announcements.\n\n## Shipping\n\nThe footprint is led by the Llama checkpoints on Hugging Face. The most-pulled by far is [`meta-llama/Llama-3.1-8B-Instruct`](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) at **11,216,853** 30-day downloads (6,013 likes), followed by the small Llama 3.2 line — [`Llama-3.2-1B-Instruct`](https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct) at **8,117,344**, [`Llama-3.2-1B`](https://huggingface.co/meta-llama/Llama-3.2-1B) at **2,338,719**, and [`Llama-3.2-3B-Instruct`](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct) at **1,693,307**. The flagship dense model [`Llama-3.3-70B-Instruct`](https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct) draws **787,281** downloads (2,805 likes), and the 405B [`Llama-3.1-405B-Instruct`](https://huggingface.co/meta-llama/Llama-3.1-405B-Instruct) sits at **219,986**.\n\nThe newest generation is MoE: [`Llama-4-Scout-17B-16E-Instruct`](https://huggingface.co/meta-llama/Llama-4-Scout-17B-16E-Instruct) (108B total params, 16 experts) at **452,362** downloads and [`Llama-4-Maverick-17B-128E-Instruct`](https://huggingface.co/meta-llama/Llama-4-Maverick-17B-128E-Instruct) (401B total, 128 experts) at **33,079**. Multimodal shows up via [`Llama-3.2-11B-Vision-Instruct`](https://huggingface.co/meta-llama/Llama-3.2-11B-Vision-Instruct) (**173,277**). A notable share of the catalog is safety tooling: [`Prompt-Guard-86M`](https://huggingface.co/meta-llama/Prompt-Guard-86M) (**697,663**), [`Llama-Guard-4-12B`](https://huggingface.co/meta-llama/Llama-Guard-4-12B) (**152,961**), [`Llama-Prompt-Guard-2-86M`](https://huggingface.co/meta-llama/Llama-Prompt-Guard-2-86M) (**136,048**), plus the [`Llama-Guard-3-8B`](https://huggingface.co/meta-llama/Llama-Guard-3-8B) and [`Llama-Guard-3-1B`](https://huggingface.co/meta-llama/Llama-Guard-3-1B) classifiers.\n\nOn GitHub the legacy [`meta-llama/llama`](https://github.com/meta-llama/llama) repo still leads at **59,454** stars, with [`llama3`](https://github.com/meta-llama/llama3) at 29,287, [`llama-cookbook`](https://github.com/meta-llama/llama-cookbook) at 18,346, [`codellama`](https://github.com/meta-llama/codellama) at 16,314, [`llama-models`](https://github.com/meta-llama/llama-models) at 7,625, and the safety repo [`PurpleLlama`](https://github.com/meta-llama/PurpleLlama) at 4,210. Recent release activity is concentrated on the hosted API surface: [`llama-api-python v0.6.0`](https://github.com/meta-llama/llama-api-python/releases/tag/v0.6.0) and [`llama-api-typescript v0.3.0`](https://github.com/meta-llama/llama-api-typescript/releases/tag/v0.3.0) are the latest of a steady cadence of SDK point releases, alongside [`llama-verifications`](https://github.com/meta-llama/llama-verifications/releases/tag/v0.1.20.1.2rc2). Newer data/ops repos — [`synthetic-data-kit`](https://github.com/meta-llama/synthetic-data-kit) (1,597 stars) and [`prompt-ops`](https://github.com/meta-llama/prompt-ops) (820) — round out the developer-tooling push.\n\n## Research themes\n\nMeta's captured engineering writing skews toward AI *infrastructure and applied LLM systems* over model releases:\n\n- **LLM inference and GPU systems at scale** — [\"Scaling LLM Inference: Innovations in Tensor Parallelism, Context Parallelism, and Expert Parallelism\"](https://engineering.fb.com/2025/10/17/ai-research/scaling-llm-inference-innovations-tensor-parallelism-context-parallelism-expert-parallelism/) (tied to the Meta AI App), [\"RCCLX: Innovating GPU Communications on AMD Platforms\"](https://engineering.fb.com/2026/02/24/data-center-engineering/rrcclx-innovating-gpu-communications-amd-platforms-meta/) (open-sourced, AMD/Torchcomms), and [\"Meta's Infrastructure Evolution and the Advent of AI\"](https://engineering.fb.com/2025/09/29/data-infrastructure/metas-infrastructure-evolution-and-the-advent-of-ai/).\n- **Vector search** — [\"Accelerating GPU indexes in Faiss with NVIDIA cuVS\"](https://engineering.fb.com/2025/05/08/data-infrastructure/accelerating-gpu-indexes-in-faiss-with-nvidia-cuvs/), reporting up to 4.7x faster IVF build and 8.1x lower search latency in Faiss v1.10.\n- **LLMs applied to software engineering** — [\"Diff Risk Score: AI-driven risk-aware software development\"](https://engineering.fb.com/2025/08/06/developer-tools/diff-risk-score-drs-ai-risk-aware-software-development-meta/) (a fine-tuned Llama predicting production-incident risk) and [\"LLMs Are the Key to Mutation Testing and Better Compliance\"](https://engineering.fb.com/2025/09/30/security/llms-are-the-key-to-mutation-testing-and-better-compliance/) (the ACH compliance-hardening tool).\n- **AI for science and the physical/AR-VR world** — [\"Using AI to make lower-carbon, faster-curing concrete\"](https://engineering.fb.com/2025/07/16/data-center-engineering/ai-make-lower-carbon-faster-curing-concrete/) (Bayesian optimization via BoTorch/Ax), [\"Meta 3D AssetGen: Generating 3D Worlds With AI\"](https://engineering.fb.com/2025/09/29/virtual-reality/assetgen-generating-3d-worlds-with-ai/), and [\"Building a human-computer interface for everyone\"](https://engineering.fb.com/2025/08/04/virtual-reality/building-a-human-computer-interface-for-everyone-meta-tech-podcast/) (Reality Labs sEMG wristband).\n\n## Hiring & scaling\n\nThe 15 captured roles read as broad product-and-platform scaling rather than a pure research build-out. Engineering is the largest bucket — multiple **Software Engineer** openings including Product, **Infrastructure**, and **AR/VR** (Redmond, WA), plus a **Machine Learning Engineer** (Palo Alto) — with research demand showing in two **Research Scientist, AI** posts (New York and Palo Alto). Supporting functions span the full product org: **Data Scientist / Data Scientist, Analytics** (Menlo Park and New York), **Technical Program Manager** (Seattle), **Product Manager / Product Designer / Product Marketing Manager**, and a **Security Engineer** (Menlo Park). Geographically the center of gravity is Menlo Park, with secondary clusters in New York, the Bay Area, and the Seattle/Redmond corridor — consistent with both the AI App / infrastructure work and the Reality Labs AR/VR investment surfaced in the writing.\n\n## Traction highlights\n\n- **Most-downloaded model:** [`Llama-3.1-8B-Instruct`](https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct) at **11.2M** 30-day downloads, with the Llama 3.2 1B/3B small models close behind (8.1M / 1.7M+).\n- **Most-starred repo:** [`meta-llama/llama`](https://github.com/meta-llama/llama) at **59,454** stars, followed by [`llama3`](https://github.com/meta-llama/llama3) (29,287) and [`llama-cookbook`](https://github.com/meta-llama/llama-cookbook) (18,346).\n- **Hacker News:** captured traction is thin — [\"Meta's Infrastructure Evolution and the Advent of AI\"](https://engineering.fb.com/2025/09/29/data-infrastructure/metas-infrastructure-evolution-and-the-advent-of-ai/) reached only **4 points / 0 comments** and [\"LLMs Are the Key to Mutation Testing and Better Compliance\"](https://engineering.fb.com/2025/09/30/security/llms-are-the-key-to-mutation-testing-and-better-compliance/) **2 points / 1 comment**. The distribution story is on Hugging Face and GitHub, not HN.\n\n## Sources\n\n- https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct\n- https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct\n- https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct\n- https://huggingface.co/meta-llama/Llama-3.1-405B-Instruct\n- https://huggingface.co/meta-llama/Llama-4-Scout-17B-16E-Instruct\n- https://huggingface.co/meta-llama/Llama-4-Maverick-17B-128E-Instruct\n- https://huggingface.co/meta-llama/Llama-3.2-11B-Vision-Instruct\n- https://huggingface.co/meta-llama/Prompt-Guard-86M\n- https://huggingface.co/meta-llama/Llama-Guard-4-12B\n- https://github.com/meta-llama/llama\n- https://github.com/meta-llama/llama3\n- https://github.com/meta-llama/llama-cookbook\n- https://github.com/meta-llama/PurpleLlama\n- https://github.com/meta-llama/llama-api-python/releases/tag/v0.6.0\n- https://github.com/meta-llama/llama-api-typescript/releases/tag/v0.3.0\n- https://engineering.fb.com/2025/10/17/ai-research/scaling-llm-inference-innovations-tensor-parallelism-context-parallelism-expert-parallelism/\n- https://engineering.fb.com/2025/05/08/data-infrastructure/accelerating-gpu-indexes-in-faiss-with-nvidia-cuvs/\n- https://engineering.fb.com/2025/08/06/developer-tools/diff-risk-score-drs-ai-risk-aware-software-development-meta/\n- https://engineering.fb.com/2025/09/30/security/llms-are-the-key-to-mutation-testing-and-better-compliance/\n- https://engineering.fb.com/2025/09/29/data-infrastructure/metas-infrastructure-evolution-and-the-advent-of-ai/\n- https://engineering.fb.com/2025/09/29/virtual-reality/assetgen-generating-3d-worlds-with-ai/\n- https://engineering.fb.com/2026/02/24/data-center-engineering/rrcclx-innovating-gpu-communications-amd-platforms-meta/\n- https://ai.meta.com","generated_at":"2026-06-08T15:59:08.887+00:00","citations":[{"url":"https://huggingface.co/meta-llama/Llama-3.1-8B-Instruct","path":null,"label":"meta-llama/Llama-3.1-8B-Instruct","type":"external"},{"url":"https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct","path":null,"label":"meta-llama/Llama-3.2-1B-Instruct","type":"external"},{"url":"https://huggingface.co/meta-llama/Llama-3.2-1B","path":null,"label":"meta-llama/Llama-3.2-1B","type":"external"},{"url":"https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct","path":null,"label":"meta-llama/Llama-3.2-3B-Instruct","type":"external"},{"url":"https://huggingface.co/meta-llama/Llama-3.3-70B-Instruct","path":null,"label":"meta-llama/Llama-3.3-70B-Instruct","type":"external"},{"url":"https://huggingface.co/meta-llama/Llama-3.1-405B-Instruct","path":null,"label":"meta-llama/Llama-3.1-405B-Instruct","type":"external"},{"url":"https://huggingface.co/meta-llama/Llama-4-Scout-17B-16E-Instruct","path":null,"label":"meta-llama/Llama-4-Scout-17B-16E-Instruct","type":"external"},{"url":"https://huggingface.co/meta-llama/Llama-4-Maverick-17B-128E-Instruct","path":null,"label":"meta-llama/Llama-4-Maverick-17B-128E-Instruct","type":"external"},{"url":"https://huggingface.co/meta-llama/Llama-3.2-11B-Vision-Instruct","path":null,"label":"meta-llama/Llama-3.2-11B-Vision-Instruct","type":"external"},{"url":"https://huggingface.co/meta-llama/Prompt-Guard-86M","path":null,"label":"meta-llama/Prompt-Guard-86M","type":"external"},{"url":"https://huggingface.co/meta-llama/Llama-Guard-4-12B","path":null,"label":"meta-llama/Llama-Guard-4-12B","type":"external"},{"url":"https://huggingface.co/meta-llama/Llama-Prompt-Guard-2-86M","path":null,"label":"meta-llama/Llama-Prompt-Guard-2-86M","type":"external"},{"url":"https://huggingface.co/meta-llama/Llama-Guard-3-8B","path":null,"label":"meta-llama/Llama-Guard-3-8B","type":"external"},{"url":"https://huggingface.co/meta-llama/Llama-Guard-3-1B","path":null,"label":"meta-llama/Llama-Guard-3-1B","type":"external"},{"url":"https://github.com/meta-llama/llama","path":null,"label":"meta-llama/llama","type":"external"},{"url":"https://github.com/meta-llama/llama3","path":null,"label":"meta-llama/llama3","type":"external"},{"url":"https://github.com/meta-llama/llama-cookbook","path":null,"label":"meta-llama/llama-cookbook","type":"external"},{"url":"https://github.com/meta-llama/codellama","path":null,"label":"meta-llama/codellama","type":"external"},{"url":"https://github.com/meta-llama/llama-models","path":null,"label":"meta-llama/llama-models","type":"external"},{"url":"https://github.com/meta-llama/PurpleLlama","path":null,"label":"meta-llama/PurpleLlama","type":"external"},{"url":"https://github.com/meta-llama/llama-api-python/releases/tag/v0.6.0","path":null,"label":"meta-llama/llama-api-python","type":"external"},{"url":"https://github.com/meta-llama/llama-api-typescript/releases/tag/v0.3.0","path":null,"label":"meta-llama/llama-api-typescript","type":"external"},{"url":"https://github.com/meta-llama/llama-verifications/releases/tag/v0.1.20.1.2rc2","path":null,"label":"meta-llama/llama-verifications","type":"external"},{"url":"https://github.com/meta-llama/synthetic-data-kit","path":null,"label":"meta-llama/synthetic-data-kit","type":"external"},{"url":"https://github.com/meta-llama/prompt-ops","path":null,"label":"meta-llama/prompt-ops","type":"external"},{"url":"https://engineering.fb.com/2025/10/17/ai-research/scaling-llm-inference-innovations-tensor-parallelism-context-parallelism-expert-parallelism/","path":null,"label":"engineering.fb.com/2025","type":"external"},{"url":"https://engineering.fb.com/2026/02/24/data-center-engineering/rrcclx-innovating-gpu-communications-amd-platforms-meta/","path":null,"label":"engineering.fb.com/2026","type":"external"},{"url":"https://engineering.fb.com/2025/09/29/data-infrastructure/metas-infrastructure-evolution-and-the-advent-of-ai/","path":null,"label":"engineering.fb.com/2025","type":"external"},{"url":"https://engineering.fb.com/2025/05/08/data-infrastructure/accelerating-gpu-indexes-in-faiss-with-nvidia-cuvs/","path":null,"label":"engineering.fb.com/2025","type":"external"},{"url":"https://engineering.fb.com/2025/08/06/developer-tools/diff-risk-score-drs-ai-risk-aware-software-development-meta/","path":null,"label":"engineering.fb.com/2025","type":"external"},{"url":"https://engineering.fb.com/2025/09/30/security/llms-are-the-key-to-mutation-testing-and-better-compliance/","path":null,"label":"engineering.fb.com/2025","type":"external"},{"url":"https://engineering.fb.com/2025/07/16/data-center-engineering/ai-make-lower-carbon-faster-curing-concrete/","path":null,"label":"engineering.fb.com/2025","type":"external"},{"url":"https://engineering.fb.com/2025/09/29/virtual-reality/assetgen-generating-3d-worlds-with-ai/","path":null,"label":"engineering.fb.com/2025","type":"external"},{"url":"https://engineering.fb.com/2025/08/04/virtual-reality/building-a-human-computer-interface-for-everyone-meta-tech-podcast/","path":null,"label":"engineering.fb.com/2025","type":"external"},{"url":"https://ai.meta.com","path":null,"label":"ai.meta.com","type":"external"}],"provenance":{"provider":null,"model":null,"workflow":"synthesize-analyses","agent":null},"evidence":{"total":null,"pages":null,"events":null,"web":null,"signal_desks":null,"data_radar_lanes":null,"data_radar_matches":null}},"signal_counts":{"total":87,"model_released":30,"release":20,"repo_new":12,"repo_forked":0,"post_published":9,"job_opened":16}}