Current Position:fig. beginning " AI Answers

How to get and use the Chinese DeepSeek-R1 distillation dataset?

2025-09-05

AI Answers

1.7 K

Guidelines for accessing and using the dataset

The process of using the Chinese DeepSeek-R1 distillation dataset can be divided into the following steps:

Acquisition Methods

Access to Hugging Face or ModelScope platforms
Search for "Chinese-DeepSeek-R1-Distill-data-110k"
Select the appropriate format (e.g. JSON, CSV, etc.) to download the dataset

Loading and use

environmental preparation: Python and datasets libraries need to be installed

Basic loading::

from datasets import load_dataset
dataset = load_dataset("Congliu/Chinese-DeepSeek-R1-Distill-data-110k")

Data Viewing: Basic information can be viewed via print(dataset) and print(dataset['train'][0])

Preprocessing and training

It is recommended to use Transformer related tool libraries (e.g. Hugging Face's transformers) for data preprocessing and model training. The dataset has been normalized, but further processing may still be performed depending on the specific task requirements.

This answer comes from the articleChinese based full-blooded DeepSeek-R1 distillation dataset, supports Chinese R1 distillation SFT datasetThe

May not be reproduced without permission:AI productivity tools " How to get and use the Chinese DeepSeek-R1 distillation dataset?

How to get and use the Chinese DeepSeek-R1 distillation dataset?

Guidelines for accessing and using the dataset

Acquisition Methods

Loading and use

Preprocessing and training

Recommended

Can't find AI tools? Try here!

Popular AI tools

New Releases

Latest AI tools

How to get and use the Chinese DeepSeek-R1 distillation dataset?

Guidelines for accessing and using the dataset

Acquisition Methods

Loading and use

Preprocessing and training

Recommended

Can't find AI tools? Try here!

Popular AI tools

New Releases

Latest AI tools

Quick query station AI tool