Machine Learning Classification and predication in weka
I am very new to machine learning. Sorry if there are any mistakes in my English. I am using the weka J48 Classification for prediction in true or false. I have almost 999K training set which i used to train the model. I used the cross validation method with 3 folds to train the Model which gives me accuracy of ~84%. Now after storing the model. i tried to test it on 50k dataset. which is giving very bad results and 50% of them are mismatch. I have 11 attributes with nominal and numeric fields. I dont know why its happening. I have two questions. How can i train to perform better on test set. what could be possible issues. I am using weka api in java.
It means that your model is overfit for your 999k training set and doesn't generalize well to your 50k testing set. You should look into cross-validating with (a good portion, but not all) of your 50k dataset in addition to your 999k. You may also want to try something higher than a k=3, k-fold crossvalidation, because k=3 folds may be too "coarse". Good luck!
WEKA Classifiers Results
Should changing a model's intercept change its Precision and Recall?
GMM Fisher Vector
Can someone give me an example how to count probabilities using Complementary Naive Bayes in Mahout?
Classification results interpretation (TFlearn, Keras)
discretization for feature selection in weka
ROC result interpretation
Classification using Mallet and MaxEntropy
Measuring Error Correlation of Classifiers
caffe: Confused about regression
How to cut a dendrogram in r
Building weka classifier
Does Orange data mining software has multi-layer perceptron classification?
User Classification in RapidMiner - output should be the user based on a fed test data
Error in building mean image file(Caffe)
caffe: probability distribution for regression / expanding classification (softmax layer) to allow 3D output