If you are using the requests library for non-OpenAI scenarios, such as using the SiliconCloud API, you need to ensure that both the payload and the request parameters are set to stream mode.
By default, the processing mechanism of the curl command buffers the output stream, so even if the server sends data in chunks, you will only see the content after the buffer is filled or the connection is closed. Passing the -N (or --no-buffer) option disables this buffering, allowing chunks of data to be printed to the terminal immediately, thus achieving streaming output.