@IslaHF and I had a backchannel conversation about DeepSeek:
DeepSeek-V3 is a super nice open-ish chat model and DeepSeek-R1 is a really good reasoning model (R1 repository and weights are openly licensed but it is not reproducible due to pieces of the pipeline missing, although efforts are underway to address that).
There are also free-as-in-beer (but not open) options, e.g. ChatGPT, Claude and Google (its Flash models are free of charge and super nice, too).
DeepSeek is, ironically, much more open than "Open"AI in areas like licenses and releasing the underlying research. However, by using DeepSeek through their chat interface or their official API, people are feeding data into China, which could potentially have geopolitical implications. Also, it refuses to talk about Taiwan’s independence or other topics sensitive to China, although it seems that when the model is run locally, the refusal rate is reportedly lower. So there’s censorship as well, which is different from the Western censorship.
What are your thoughts on using models like this? What are your considerations when choosing a chatbot or an API?
Super curious to hear people’s thoughts, including @moodler, who’s been doing a lot of thinking about AI recently.