classification


Machine Learning Classification and predication in weka


I am very new to machine learning. Sorry if there are any mistakes in my English.
I am using the weka J48 Classification for prediction in true or false. I have almost 999K training set which i used to train the model. I used the cross validation method with 3 folds to train the Model which gives me accuracy of ~84%.
Now after storing the model. i tried to test it on 50k dataset. which is giving very bad results and 50% of them are mismatch. I have 11 attributes with nominal and numeric fields.
I dont know why its happening.
I have two questions.
How can i train to perform better on test set.
what could be possible issues.
I am using weka api in java.
It means that your model is overfit for your 999k training set and doesn't generalize well to your 50k testing set.
You should look into cross-validating with (a good portion, but not all) of your 50k dataset in addition to your 999k.
You may also want to try something higher than a k=3, k-fold crossvalidation, because k=3 folds may be too "coarse". Good luck!

Related Links

WEKA Classifiers Results
Should changing a model's intercept change its Precision and Recall?
GMM Fisher Vector
Can someone give me an example how to count probabilities using Complementary Naive Bayes in Mahout?
Classification results interpretation (TFlearn, Keras)
discretization for feature selection in weka
ROC result interpretation
Classification using Mallet and MaxEntropy
Measuring Error Correlation of Classifiers
caffe: Confused about regression
How to cut a dendrogram in r
Building weka classifier
Does Orange data mining software has multi-layer perceptron classification?
User Classification in RapidMiner - output should be the user based on a fed test data
Error in building mean image file(Caffe)
caffe: probability distribution for regression / expanding classification (softmax layer) to allow 3D output

Categories

HOME
drupal
redux
oracle-sqldeveloper
compression
service
pelican
c++builder
reference
conceptual
azure-database-mysql
crash
sms
virtualhost
google-my-business-api
dompdf
brightway
printf
bug-reporting
xna
procedural-generation
quality-center
edirectory
undo
deferred
wercker
imdb
redhat-brms
mediastream
ninja-forms
contenteditable
peerjs
head
angular-fullstack
hierarchical-clustering
g77
hdpi
jsfl
stanford-nlp-server
django-tables2
flask-socketio
webmatrix
scrollmagic
breadcrumbs
fileopendialog
zsh-completion
alfred
symantec
hfp
bytearray
avaudiorecorder
shtml
razorengine
cexception
overlapping
pushwoosh
winforms-interop
avd
teamstudio-unplugged
cordova-3
register-allocation
gnip
windows-search
time-frequency
drawable
msdropdown
nomachine
google-mirror-api
caroufredsel
zend-db-table
leap-year
proj4
wordbreaker
pageload
blotter
report-viewer2010
motordriver
derived-class
pdf-reader
django-1.4
reflexil
visual-leak-detector
discussion-board
viewdata
for-xml
konsole
time-management
manuals
projectgen
vista64

Resources

Mobile Apps Dev
Database Users
javascript
java
csharp
php
android
MS Developer
developer works
python
ios
c
html
jquery
RDBMS discuss
Cloud Virtualization
Database Dev&Adm
javascript
java
csharp
php
python
android
jquery
ruby
ios
html
Mobile App
Mobile App
Mobile App