Wednesday, December 10, 2014

Competency 8.1

Competency 8.1: Prepare data for use in LightSIDE and use LightSIDE to extract a wide range of feature types.

For the purpose of this exercise, I created a simple data set of three types of plants: vegetables, fruits and flowers. I classified text (taken from Wikipedia) based on the three categories. It looked like this:

I loaded my input csv file into LightSIDE and extracted basic features like unigrams and bigrams first. Then I checked different basic features and extracted their feature sets.

I saved all the feature sets for building models later using alternative feature spaces. 

No comments:

Post a Comment

All materials are based on the EdX course - Data, Analytics and Learning
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.