Release notes

2025.12.04

Platform Service Adjustment Notice

To further optimize the model service quality, the platform will gradually update the Deepseek-V3.2-Exp model to the Deepseek-V3.2 version over the next two days. Requests to Pro/deepseek-ai/DeepSeek-V3.2-Exp and deepseek-ai/DeepSeek-V3.2-Exp will be redirected to Pro/deepseek-ai/DeepSeek-V3.2 and deepseek-ai/DeepSeek-V3.2, respectively.

2025.11.17

Platform Service Adjustment Notice

To further optimize resource allocation and provide more advanced and high-quality technical services, the platform will discontinue the following models on 2025-11-20:

inclusionAI/Ling-1T
inclusionAI/Ring-1T

If you are currently using any of the above models, it is recommended that you switch to an alternative model as soon as possible to avoid any impact on your services.

2025.11.11

Platform Service Adjustment Notice

To further optimize resource allocation and provide more efficient and stable computing power services, the platform will adjust the Rate Limits for certain models starting from November 11, 2025.The models affected by this adjustment are: Pro/deepseek-ai/DeepSeek-R1, Pro/deepseek-ai/DeepSeek-V3, Pro/deepseek-ai/DeepSeek-V3.1-Terminus, zai-org/GLM-4.6,inclusionAI/Ling-1T，inclusionAI/Ring-1T,MiniMaxAI/MiniMax-M2;If your business has specific requirements for high concurrency or large-scale throughput, please contact us to apply for a higher quota.Thank you for your understanding and support.

2025.11.06

Platform Service Adjustment Notice

To further optimize resource allocation and provide more efficient and stable computing services, the platform will close the purchase entry for Usage Tiers starting from November 7, 2025.This adjustment only affects the availability of new purchase entries. Your existing Usage Tiers, current usage level, and the platform’s automatic upgrade/downgrade mechanism based on consumption amount will remain unaffected.If you have a need to quickly increase your usage tier or raise your Rate Limits, please contact us.Thank you for your understanding and support.

2025.09.29

Platform Service Adjustment Notice

To further optimize resource allocation and provide more advanced and high-quality technical services, the platform will discontinue the following models on 2025-10-09:

deepseek-ai/DeepSeek-V3.1
Pro/deepseek-ai/DeepSeek-V3.1

If you are using any of the above models, it is recommended that you switch to V3.1 Terminus as soon as possible to avoid any disruption in service.

2025.09.16

Platform Service Adjustment Notice

To further optimize the quality of model services, the platform updated the moonshotai/Kimi-K2-Instruct and Pro/moonshotai/Kimi-K2-Instruct models to the latest 0905 version on September 15. The previous 0711 version will no longer be available.The moonshotai/Kimi-K2-Instruct and Pro/moonshotai/Kimi-K2-Instruct models have been removed from the Model Plaza. All corresponding model requests will now be directed to moonshotai/Kimi-K2-Instruct-0905 and Pro/moonshotai/Kimi-K2-Instruct-0905, respectively.

2025.08.22

Platform Service Adjustment Notice

To further optimize resource allocation and provide more advanced and high-quality technical services, the platform will take the following models offline on September 4, 2025:

tencent/HunyuanVideo-HD
Wan-AI/Wan2.1-I2V-14B-720P-Turbo
Wan-AI/Wan2.1-I2V-14B-720P
Wan-AI/Wan2.1-T2V-14B-Turbo
Wan-AI/Wan2.1-T2V-14B

If you are currently using any of the above models, it is recommended that you switch to other models as soon as possible to avoid any impact on your services.

2025.06.23

Platform Service Adjustment Notice

To further optimize resource allocation and provide more advanced and high-quality technical services, the platform will take the following models offline on July 3, 2025:

Pro/deepseek-ai/DeepSeek-R1-0120
Pro/deepseek-ai/DeepSeek-V3-1226
Qwen/QwQ-32B-Preview

If you are currently using any of the above models, it is recommended that you switch to other models as soon as possible to avoid any impact on your services.

2025.06.06

Platform Maintenance Notice

To provide more rich, advanced, and high-quality services, the platform will undergo maintenance from 23:00 on June 10, 2025 to 08:00 on June 11, 2025.Affected by the system maintenance:

The following functions on cloud.siliconflow.cn will be suspended: registration, login, and other interface operations, including but not limited to the ones listed below.
- Online model experience, fine-tuning, and batch inference;
- Viewing model lists and details on the official website model marketplace;
- Online top-up, purchasing tiered packages, checking bills, and issuing invoices, etc.;
The /user/info API will be adjusted, and the fields name\ image\ and email will no longer be returned, with a fixed output of an empty string.

The platform’s API services will not be affected by the maintenance and can be continuously used. We recommend that you check your account balance in advance to avoid service restrictions due to insufficient balance.

2025.05.29

Platform Service Adjustment Notice

SiliconFlow will initiate the DeepSeek R1 model update.For the deepseek-ai/DeepSeek-R1 and Pro/deepseek-ai/DeepSeek-R1 models, an “incremental” update to the latest 0528 version will be carried out. After the update is completed, both models will be upgraded to the 0528 version. If needed, you can still use the older version of the model via Pro/deepseek-ai/DeepSeek-R1-0120 until June 28, 2025, to achieve a smoother transition for your business.

2025.05.23

Platform Service Adjustment Notice

To further optimize resource allocation and provide more advanced and high-quality technical services, the platform will take the following models offline on June 5, 2025:

Qwen/Qwen2-1.5B-Instruct
Pro/Qwen/Qwen2-1.5B-Instruct
Pro/Qwen/Qwen2-VL-7B-Instruct
THUDM/chatglm3-6b
internlm/internlm2_5-20b-chat
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Pro/deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

If you are currently using any of the above models, it is recommended that you switch to another model as soon as possible to avoid any impact on your service.

2025.04.17

Platform Service Adjustment Notice

To further optimize resource allocation and deliver more advanced and high-quality technical services, the platform will retire the HunyuanVideo model (non-HunyuanVideo-HD) on April 29, 2025.If you are currently using this model, we strongly recommend switching to alternative models as soon as possible to avoid service disruptions.

2025.03.26

Platform Service Adjustment Notice

As of now, the Pro/deepseek-ai/DeepSeek-V3 and deepseek-ai/DeepSeek-V3 model has been updated to the latest 0324 version. You can still use the older model via Pro/deepseek-ai/DeepSeek-V3-1226 to facilitate a smoother transition of your business.

2025.03.25

Platform Service Adjustment Notice

SiliconFlow will initiate the update of the DeepSeek V3 model.For the deepseek-ai/DeepSeek-V3 and Pro/deepseek-ai/DeepSeek-V3 models, they will be “progressively” updated to the latest 0324 version.After the update, both models will be at the 0324 version. If needed, you can still use the old version of the model via deepseek-ai/DeepSeek-V3-1226 until April 30, 2025, to facilitate a smoother business transition.

2025.03.11

Platform Service Adjustment Notice

To better serve global developer users, SiliconFlow will soon launch an international site and gradually open multiple service regions.Due to this adjustment, the existing api.siliconflow.com API endpoint will be phased out at an appropriate time. Please switch to api.siliconflow.cn as soon as possible to continue using the service.We have already configured Global Traffic Manager (GTM) for the .cn endpoint to provide the same global access experience as the current .com endpoint. You only need to change the base URL of your API requests to api.siliconflow.cn.We recommend that you complete the migration by the end of this month (March 31). If you have any questions, please contact us at any time.

2025.03.07

Platform Service Adjustment Notice

To continuously improve user experience, the Rate Limits policy is being adjusted as follows:Remove the RPH and RPD rate limits for deepseek-ai/DeepSeek-R1 and deepseek-ai/DeepSeek-V3.As traffic and load change, the policy may be adjusted at any time, and Silicic Flow reserves the right to interpret.

2025.02.27

Platform Service Adjustment Notice

1. Model Offline notice

To further optimize resource allocation and provide more advanced, high-quality, and compliant technical services, the platform will shut down certain models on March 6, 2025. The specific list of models involved is as follows:

Chat models:
- AIDC-AI/Marco-o1
- meta-llama/Meta-Llama-3.1-8B-Instruct
- Pro/meta-llama/Meta-Llama-3.1-8B-Instruct
- meta-llama/Meta-Llama-3.1-70B-Instruct
- meta-llama/Meta-Llama-3.1-405B-Instruct
- meta-llama/Llama-3.3-70B-Instruct
Image generation models:
- black-forest-labs/FLUX.1-schnell
- Pro/black-forest-labs/FLUX.1-schnell
- black-forest-labs/FLUX.1-dev
- black-forest-labs/FLUX.1-pro
- stabilityai/stable-diffusion-xl-base-1.0
- stabilityai/stable-diffusion-3-5-large
- stabilityai/stable-diffusion-3-5-large-turbo
- stabilityai/stable-diffusion-2-1
- deepseek-ai/Janus-Pro-7B
Voice models:
- fishaudio/fish-speech-1.5
- FunAudioLLM/SenseVoiceSmall
- FunAudioLLM/CosyVoice2-0.5B
- fishaudio/fish-speech-1.4
- RVC-Boss/GPT-SoVITS
Video models:
- Lightricks/LTX-Video
- genmo/mochi-1-preview

2025.02.22

Platform Service Adjustment Notice

To ensure the quality of platform services and the rational allocation of resources, the following adjustments to Rate Limits policies are now in effect:

Adjustments

New RPH Limit (Requests Per Hour, Per Hour Requests)

Model Scope:deepseek-ai/DeepSeek-R1, deepseek-ai/DeepSeek-V3
Applicable Users: All users
Limit Standard: 30 requests/hour

2.New RPD Limit (Requests Per Day, Per Day Requests)

Model Scope: deepseek-ai/DeepSeek-R1, deepseek-ai/DeepSeek-V3
Applicable Users: Users who have not completed real-name authentication
Limit Standard: 100 requests/day

Please note that these policies may be adjusted at any time based on traffic and load changes. Silicon Flowing Reserves the right to interpret these policies.

2025.02.13

1. Model Offline notice

To provide more stable, high-quality, and sustainable services, the following models will be offline on February 27, 2025:

01-ai/Yi-1.5-34B-Chat-16K
01-ai/Yi-1.5-6B-Chat
01-ai/Yi-1.5-9B-Chat-16K
stabilityai/stable-diffusion-3-medium
google/gemma-2-27b-it
google/gemma-2-9b-it
Pro/google/gemma-2-9b-it

If you are using any of these models, it is recommended to migrate to other models available on the platform as soon as possible.

2025.02.09

Platform service adjustment notice

DeepSeek-V3 model prices have been restored to the original price starting from Beijing time February 9, 2025, at 00:00.

Specific prices:

Input: ¥2/M Tokens
Output: ¥8/M Tokens

2025.02.03

Inference model output adjustment notice

The display of the reasoning chain in the inference model will be separated into a separate reasoning_content field from the content field. This change is compatible with the OpenAI and DeepSeek API specifications, making it easier for various frameworks and upper-layer applications to trim the conversation in multi-round dialogues. For more details, see the Inference Model (DeepSeek-R1) Usage.

2025.02.01

Platform service adjustment notice

Support for deepseek-ai/DeepSeek-R1 and deepseek-ai/DeepSeek-V3 Models

The specific pricing is as follows:

deepseek-ai/DeepSeek-R1 Input:￥4/ M Tokens Output: ￥16/ M Tokens
deepseek-ai/DeepSeek-V3
- From February 1, 2025, to February 8, 2025, 24:00 Beijing Time, enjoy a limited-time discount price：Input：¥2￥1/ M Tokens Output：¥8￥2/ M Tokens，The original price will be restored from February 9, 2025, 00:00.

2024.12.27

Platform service adjustment notice

Image and Video URL Validity Period Adjusted to 1 Hour

To continue providing you with more advanced and high-quality services, the validity period of image and video URLs generated by large models will be adjusted to 1 hour starting from January 20, 2025.If you are currently using the image and video generation service, please make sure to back up the files in time to avoid any business disruptions due to URL expiration.

2024.12.24

Platform service adjustment notice

LTX-Video Model Will Start Charging

To continue providing you with more advanced and high-quality services, the platform will start charging for video generation requests using the Lightricks/LTX-Video model starting from January 6, 2025, at a rate of 0.14 yuan per video.

2024.12.13

Platform service adjustment notice

1. Model Offline notice

To provide more stable, high-quality, and sustainable services, the following models will be offline on December 19, 2024:

If you are using any of these models, it is recommended to migrate to other models available on the platform as soon as possible.

2024.12.5

Platform service adjustment notice

1. Model offline notice

To provide more stable, high-quality, and sustainable services, the following models will be offline on December 13, 2024:

If you are using any of these models, it is recommended to migrate to other models available on the platform as soon as possible.

2024.11.14

Platform service adjustment notice

1. Model offline notice

To provide more stable, high-quality, and sustainable services, the following models will be offline on November 22, 2024:

If you are using any of these models, it is recommended to migrate to other models available on the platform as soon as possible.To further enhance service experience, the platform will update the login method starting from November 22, 2024: from the original “email account + password” method to an “email account + verification code” method.

3. New Overseas API Endpoint

A new endpoint for overseas users has been added: https://api-st.siliconflow.cn. If you encounter network connection issues while using the original endpoint https://api.siliconflow.cn, it is recommended to switch to the new endpoint.

2024.10.09

Partial Model Pricing Adjustment Notice

To provide more stable, high-quality, and sustainable services, the Vendor-A/Qwen/Qwen2-72B-Instruct model, which was previously offered for free, will start charging from October 17, 2024. The pricing details are as follows:

Limited-time discount price：¥ 1.00 / M tokens
Original price：¥ 4.13 / M tokens（the original price will be restored at a later date）

​Platform Service Adjustment Notice

​Platform Service Adjustment Notice

​Platform Service Adjustment Notice

​Platform Service Adjustment Notice

​Platform Service Adjustment Notice

​Platform Service Adjustment Notice

​Platform Service Adjustment Notice

​Platform Service Adjustment Notice

​Platform Maintenance Notice

​Platform Service Adjustment Notice

​Platform Service Adjustment Notice

​Platform Service Adjustment Notice

​Platform Service Adjustment Notice

​Platform Service Adjustment Notice

​Platform Service Adjustment Notice

​Platform Service Adjustment Notice

​Platform Service Adjustment Notice

​1. Model Offline notice

​Platform Service Adjustment Notice

​1. Model Offline notice

​Platform service adjustment notice

​DeepSeek-V3 model prices have been restored to the original price starting from Beijing time February 9, 2025, at 00:00.

​Inference model output adjustment notice

​Platform service adjustment notice

​Support for deepseek-ai/DeepSeek-R1 and deepseek-ai/DeepSeek-V3 Models

​Platform service adjustment notice

​Image and Video URL Validity Period Adjusted to 1 Hour

​Platform service adjustment notice

​LTX-Video Model Will Start Charging

​Platform service adjustment notice

​1. Model Offline notice

​Platform service adjustment notice

​1. Model offline notice

​Platform service adjustment notice

​1. Model offline notice

​2.Email login method update

​3. New Overseas API Endpoint

​Partial Model Pricing Adjustment Notice

Platform Service Adjustment Notice

Platform Service Adjustment Notice

Platform Service Adjustment Notice

Platform Service Adjustment Notice

Platform Service Adjustment Notice

Platform Service Adjustment Notice

Platform Service Adjustment Notice

Platform Service Adjustment Notice

Platform Maintenance Notice

Platform Service Adjustment Notice

Platform Service Adjustment Notice

Platform Service Adjustment Notice

Platform Service Adjustment Notice

Platform Service Adjustment Notice

Platform Service Adjustment Notice

Platform Service Adjustment Notice

Platform Service Adjustment Notice

1. Model Offline notice

Platform Service Adjustment Notice

1. Model Offline notice

Platform service adjustment notice

DeepSeek-V3 model prices have been restored to the original price starting from Beijing time February 9, 2025, at 00:00.

Inference model output adjustment notice

Platform service adjustment notice

Support for deepseek-ai/DeepSeek-R1 and deepseek-ai/DeepSeek-V3 Models

Platform service adjustment notice

Image and Video URL Validity Period Adjusted to 1 Hour

Platform service adjustment notice

LTX-Video Model Will Start Charging

Platform service adjustment notice

1. Model Offline notice

Platform service adjustment notice

1. Model offline notice

Platform service adjustment notice

1. Model offline notice

2.Email login method update

3. New Overseas API Endpoint

Partial Model Pricing Adjustment Notice