Categories
Misc

NLP: Simple way of loading labeled unstructured text data from pandas dataframe into a tensorflow DataSet?

I can load text data using: tf.keras.utils.text_dataset_from_directory but to use that, each of my “rows” need to be in individual files in their own directory and the label is determined by the directory name.

For the life of me, I can’t figure out how to convert a pandas dataframe into a DataSet.

The dataframe would have some structure like this:

index label unstructured_text 1 1 i like ice cream 2 0 i don't like ice cream 3 1 i'm a little teapot and i'm happy 

etc.

Thank you for any hints.

submitted by /u/1-800-AVOGADRO
[visit reddit] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *