DeepSeek is a Chinese free artificial intelligence that develops open-source large language models (LLMs). Located in Hangzhou, Zhejiang. Liang Wenfeng was the founder of DeepSeek.
On 2 November 2023, DeepSeek released its first series of models, DeepSeek-Coder, which is available for free to both researchers and commercial users. DeepSeek LLM is detailed below. The series includes 8 models, 4 pretrained (Base) and 4 instruction-finetuned (Instruct). They all have 16K context lengths. They were trained on clusters of A100 and H800 Nvidia GPUs, connected by InfiniBand, NVLink, and NVSwitch.
DeepSeek's optimization of limited resources has highlighted potential limits of U.S. sanctions on China's A.I. development, which include export restrictions on advanced A.I. chips to China. DeepSeek’s models appear to be only a fraction of what is required for OpenAI or Meta Platforms’ best products. Both DeepSeek and OpenAI have developed advanced large language models (LLMs), but they differ in their approaches, costs, and philosophies.