Skip to content

Latest commit

 

History

History
20 lines (13 loc) · 1.42 KB

File metadata and controls

20 lines (13 loc) · 1.42 KB

dabling

AI-assisted classification

Dabling is a research project of Statistics Netherlands.

This repo contains the code of a shiny dashboard developed within the joint international project ESSnet AIML4OS Work Package 10 that shows what can be done with the Dabling concept for classification.

The dashboard is here

The user can give in an OpenAI API Key that will be used for semantic text enrichtment in the classification task. The costs for this API key are for the user. With this demo, it will never exceed tokens, which is less then Eur.

This first version of the dashboard uses:

  • the AGNews dataset from the textdata R-package. The AGNews dataset is a collection of news articles, where each article is labeled with a topic (e.g. politics, sports, etc.).
  • The AG’s news topic classification. This dataset is constructed by choosing 4 largest classes from the original corpus. Each class contains 30,000 training samples and 1,900 testing samples. The total number of training samples is 120,000 and testing 7,600. Version 3, Updated 09/09/2015

A classification to show the concept. Additional classifications and examples will be added later.

Comments and suggestions welcome at:

  • Jeldrik Bakker: j <dot> bakker <at> cbs.nl
  • Olav ten Bosch: o <dot> tenbosch <at> cbs.nl