Plan

  1. Data collecting
  2. Tags and code markup
  3. Creating of classification models
  4. Building a knowledge graph of Deep Learning process
  5. Proof-of-the-concept of generative code model according to NL description

Current Tasks (until 1.Sep)

The current short-term goal is to build a model that will be able to classify a source code chunk and to specify where the detected class is exactly in the chunk (tag segmentation).

Datasets description

(backup stores on https://yadi.sk/d/qY9lEd6-275KEw)

Basically, you can download all the data and the models with dvc pull:

  1. Clone a repo to some folder
  2. Install DVC
  3. Open a terminal and go to the folder with repo
  4. dvc pull or dvc pull data (if you want only the data without models)
  5. Enjoy!