Skip to main content
DeepStudent is an open-source, AI-native, local-first learning system designed to build a complete learning loop from input to internalization. Powered by the Chat V2 dialogue engine and driven by a skill-based architecture, it uses a virtual file system as a unified data foundation to integrate intelligent conversation, knowledge management, Anki card generation, deep research, academic writing, knowledge mapping, question bank practice, translation workspace, essay grading, and more into a single platform. All data is stored locally (SQLite + LanceDB + Blob), ensuring security and full control. Through deep integration with SiliconFlow, a single SiliconFlow API Key is all you need to instantly unlock a complete capability matrix—including chat, reasoning, embeddings, OCR, translation, and reranking—allowing every learner to rapidly build their own AI-powered learning infrastructure.

1. Install DeepStudent

DeepStudent is built with Tauri 2 and supports mainstream platforms including Windows, macOS, and Android. Simply download and install the package to get started.

2. Using SiliconFlow Models in DeepStudent

2.1 Obtain a SiliconFlow API Key

  1. Log in to SiliconFlow (registration is completed automatically on first login).
  2. Navigate to the “API Keys” page in the console.
  3. Click “Create New Key” and assign it a name (e.g., DeepStudent).
  4. Copy the generated key and store it securely.

2.2 Configure in DeepStudent

  1. Open DeepStudent and click the bottom-left corner to enter the Settings Center.
  2. Locate the dedicated SiliconFlow configuration section. As the preferred provider, SiliconFlow offers an independent quick configuration interface.
  1. Enter your SiliconFlow API Key.
  2. Click the “One-Click Allocation” button. DeepStudent will automatically create and assign all required models:
    • Main chat model (DeepSeek-V3.2)
    • Anki card generation model (Qwen3-30B-A3B)
    • Embedding model (bge-m3)
    • Reranking model (bge-reranker-v2-m3)
    • Translation model (Hunyuan-MT-7B)
    • Lightweight task model (Ling-mini-2.0)
    • Five OCR engines (PaddleOCR-VL-1.5 / PaddleOCR-VL / DeepSeek-OCR / GLM-4.6V / Qwen3-VL-8B)
    No need to manually add them one by one. For superior performance, we recommend manually assigning SiliconFlow’s Pro series models. You may also click “Fetch Model List” to retrieve the full set of available models from the SiliconFlow platform and add them as needed. DeepStudent automatically infers each model’s capabilities (multimodal, reasoning, embedding, tool-calling, etc.) and configures optimal adaptation parameters.

3. Getting Started

3.1 AI Intelligent Dialogue & Deep Reasoning

DeepStudent’s Chat V2 dialogue engine is purpose-built for learning scenarios. Paired with reasoning models such as DeepSeek-V3.2, it supports multimodal file drag-and-drop uploads with automatic OCR (images, PDFs, Word documents, etc.), deep reasoning (displaying the AI’s full chain of thought), multi-tab sessions, session branching, and context injection via the reference panel to attach knowledge base resources directly into conversations.

3.2 Skill System

The core driving force of DeepStudent is its Skill System. Unlike traditional AI tools that cram all instructions into a bloated system prompt, DeepStudent decomposes AI capabilities into independent skill modules. Each skill encapsulates scenario-specific instructions and toolsets, which are progressively disclosed only when activated. This reduces token consumption while maintaining professional precision. Currently, 22 skills are built in (including 5 integrated skills and 17 tool-group skills), covering core learning scenarios such as card generation, research, academic writing, knowledge mapping, question banks, memory enhancement, translation, tutoring, literature review, Office suite operations, and sub-agents. Users can also extend capabilities via custom SKILL.md files. The default “Deep Scholar” strategy proactively recalls user profiles and prioritizes retrieval from the local knowledge base, ensuring every conversation benefits from personalized context. The skill system grants AI flexible read/write capabilities over learning data.

3.3 Learning Resource Center

DeepStudent includes a macOS Finder-like Learning Hub that centrally manages all learning assets—notes, PDFs, question sets, translations, essays, knowledge maps, images, and more. Supported import formats include PDF, DOCX, XLSX, PPTX, EPUB, RTF, Markdown, CSV, HTML, JSON, and common image formats such as JPG/PNG/HEIC. After import, resources automatically enter the vectorization pipeline: text is recognized via OCR models (PaddleOCR-VL / DeepSeek-OCR, etc.), then converted into vector embeddings using the bge-m3 model and stored in LanceDB. The vector index supports multi-dimensional management—users can create different embedding spaces (text / multimodal) and bind them to different embedding models to meet diverse retrieval needs.

3.3 Deep Research

After activating the “Deep Research” skill, AI executes a complete multi-step agent workflow: Define objective → Online search (supports 7 search engines, configuration required) → Local knowledge base retrieval → Organization & analysis → Structured report generation. The final report is automatically saved as a note. Ideal for research projects, thesis proposals, and academic literature reviews.

3.4 ChatAnki Intelligent Card Generation

Trigger the card-generation skill in natural language during conversation (e.g., “Turn this document into flashcards”). AI will batch-generate high-quality memory cards. A built-in template designer supports visual editing. Generated cards can be previewed with 3D flip animations, and once confirmed, synced to Anki with one click—bridging the final step from “understanding” to “memory.”

3.5 More Learning Scenarios

  • Knowledge Mapping: Generate a complete subject knowledge system with a single sentence in chat. Supports multi-round AI editing, outline/mind-map view switching, and memorization mode.
  • Question Bank Practice: Upload textbooks or exam papers. AI automatically extracts question sets and supports daily practice, mock exams, and in-depth AI explanations.
  • Translation Workspace: Full-text translation with paragraph-by-paragraph bilingual alignment. Built-in presets for academic, technical, and literary domains.
  • Essay Grading: Covers Gaokao, IELTS, TOEFL, CET-4/6, and other standards. Provides multi-dimensional scoring and sentence-level refinement suggestions.
  • Intelligent Memory: Inspired by mem0 and memU, this self-evolving user profiling system automatically extracts facts from conversations—becoming more personalized over time.
  • MCP Extension: Compatible with Model Context Protocol, enabling connection to external tool services such as Context7.
  • Academic Papers: Retrieve papers via arXiv and OpenAlex. Supports batch PDF downloads, multi-source fallback, and citation formatting in BibTeX / GB/T 7714 / APA styles.
For every learner pursuing efficiency, a single API Key is all it takes to activate a complete AI-powered learning pipeline—from knowledge input to memory internalization.