Products Articles About

From deep
research to
real-world
impact.

从深度研究
到现实落地

We build high-performance AI systems, publish cutting-edge research, and ship real products — from wafer-scale inference engines to intelligent rental agents. Partnering with Tencent, NHS, and Harvard to push what AI can do.

Our Products

View all
WaferLLM

WaferLLM

WaferLLM is the first wafer-scale LLM inference system, designed for a next-generation AI accelerator with hundreds of thousands of cores, tens of gigabytes of distributed on-chip memory, and tens of PB/s on-chip bandwidth. It introduces novel parallel strategies and kernel implementations that achieve orders-of-magnitude performance improvements over GPU-based systems.

Learn more
BioVLM 8B

BioVLM

BioVLM 8B is a cost-efficient scientific domain vision-language model that surpasses GPT-5.2 on biological research tasks. Developed in collaboration with Harvard Medical School and Edinburgh's Roslin Institute, it uses automated rich-text data synthesis from raw PDF papers to train a domain-specialized VLM — with the entire pipeline costing less than $200.

Learn more
AI4Whisky

AI4Whisky

AI4Whisky is a collaboration with the Scottish Government, the University of Edinburgh, and ICBD to help whisky distilleries calculate their carbon emissions and receive tailored reduction recommendations. Scotland's whisky industry generated £710 million in added value in 2022, and the sector aims for net-zero emissions by 2040 — but most small and medium distilleries lack the resources for proper carbon footprint assessment.

Learn more

Systems that scale, research that leads, products that ship.