llm-dataset-converter release
Version 0.2.5 of our llm_dataset_converter library was released December 20th.
With the additional release of ldc_gitingest for converting git repositories (local or remote) into pretrain text files, we also made a new release of our llm_dataset_converter_all meta-library. The newest version here is 0.0.4. A new Docker image is available as well.