DeepSeek has finally released its much-anticipated next-generation open-source foundational AI model, V4, which it said was competitive with leading US closed-source models from the likes of OpenAI and Google DeepMind.
DeepSeek released two versions of the model on Friday, with the V4-pro model boasting 1.6 trillion parameters, making it the Hangzhou start-up’s biggest ever model, while the smaller V4-flash model has 284 billion parameters.
Both models have a context window of 1 million tokens, a critical feature that determines the amount of information an artificial intelligence system is able to process, which DeepSeek said was achieved with “world-leading” cost efficiency.
More to follow…