Continuing to expand the framework MediaTek has established for Llama 2 and Llama 3 models, we are once again working to leverage a combination of NPU Hardware Acceleration and software tools to officially support Llama 3.2 on Dimensity platforms, including the upcoming Dimensity 9400 and other Generative AI-enabled platforms.
The Llama 3.2 release includes several new, updated and highly differentiated models across a spectrum of sizes and capabilities, as well as robust system level safety support including image input guardrails. The 1B, 3Band 11B models support on-device use cases like knowledge retrieval and summarization, instruction following, and rewriting tasks running locally at the edge. The ability for MediaTek to run Llama 3.2 on-device provides many benefits for developers and users, including faster response times, lower latency, and reduced power consumption. Smaller Llama models facilitate on-device Generative AI solutions through low memory usage for a smoother user experience.
Developers will be able to utilize Llama 3.2 through MediaTek’s NeuroPilot SDK, a toolkit that enables and optimizes on-device Gen-AI inference capabilities across our diverse product portfolio, including mobile platforms and edge-AI capable devices.