dsdl

DSDL: Standard Description Language for DataSet

1. Abstract

Data is the cornerstone of artificial intelligence. The efficiency of data acquisition, exchange, and application directly impacts the advances in technologies and applications. Over the long history of AI, a vast quantity of data sets have been developed and distributed. However, these datasets are defined in very different forms, which incurs significant overhead when it comes to exchange, integration, and utilization -- it is often the case that one needs to develop a new customized tool or script in order to incorporate a new dataset into a workflow.

To overcome such difficulties, we develop Data Set Description Language (DSDL). More details please visit our official documents, dsdl datasets can be downloaded from our platform OpenDataLab.

2. Steps

install dsdl:

install by pip:

pip install dsdl

install by source code:

git clone https://github.com/opendatalab/dsdl-sdk.git -b schema-dsdl
cd dsdl-sdk
python setup.py install

install mmdet and pytorch: please refer this installation documents.

train:

using single gpu:

python tools/train.py {config_file}

using slurm:

./tools/slurm_train.sh {partition} {job_name} {config_file} {work_dir} {gpu_nums}

3. Test Results

detection task:

Datasets Model box AP Config

VOC07+12 model 80.3* config

COCO model 37.4 config

Objects365 model 19.8 config

OpenImages model 59.9* config

*: box AP in voc metric and openimages metric, actually means AP_50.
instance segmentation task:

Datasets Model box AP mask AP Config

COCO model 38.1 34.7 config

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
coco.py		coco.py
coco_instance.py		coco_instance.py
objects365v2.py		objects365v2.py
openimagesv6.py		openimagesv6.py
voc07.py		voc07.py
voc0712.py		voc0712.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dsdl

dsdl

README.md

DSDL: Standard Description Language for DataSet

1. Abstract

2. Steps

3. Test Results

Datasets	Model	box AP	Config
VOC07+12	model	80.3*	config
COCO	model	37.4	config
Objects365	model	19.8	config
OpenImages	model	59.9*	config

Files

dsdl

Directory actions

More options

Directory actions

More options

Latest commit

History

dsdl

Folders and files

parent directory

README.md

DSDL: Standard Description Language for DataSet

1. Abstract

2. Steps

3. Test Results