Voxtral models are state-of-the-art speech understanding models, which are available in two sizes — a 24B variant for production-scale applications and a 3B variant for local and edge deployments. Both versions are released under the Apache 2.0 license.