Back

Some examples from classification
Mingyan Shao, July 23,2003

The following table shows the classification result. There are four categories, correct positives(positive sample -> positive result), false negatives(negative sample -> positive result), false positives(positive sample -> negative result), and correct negatives(negative sample -> negative result).
For each category, we select a few samples, with large, medium, and borderline values respectively, except for false negative since there is only one sample in this category.
In each category, the total number of diagrams is shown, as well as the sample digaram numbers and their SVM values.

 


Positive(Real)
Negative(Real)
Positive(Classification)
Total number Diagram#, SVM_value
65
52 (large 1.79)
45 (medium 0.91)
17 (borderline 0.48)
Total Number
Diagram#, SVM_value
11
71 (large 25.53)
63 (borderline 0.17)
Negative(Classification)
Total number Diagram#, SVM_value
1 28 ( -0.928 )
Total number Diagram#, SVM_value
53
133 (large -14.71)
2 (medium -2.85)
9 (borderline -0.046)


The classification tool used is SVM Light, and the data set is 129 diagrams(/proj/bkl/myshao/download/pdf/result/).

The bar_classification_result is as follows(from Andrea Grimes):
===============================================================================================================
Andrea Grimes
Results from May 2, 2003.
Updated July 18, 2003.

Tested 129 diagrams: tried to classifiy them as either a bar graph or not. 
The training file used was train_bar3.dat in /proj/bkl/agrimes/svm_light/training_files_ICDAR03/

Each line is specified as followed:
    <line number>  <pos | neg classification>  <svm_classify value>   < correctly | incorrectly classified>

     1    NEGATIVE classification (-2.5339222) CORRECT
     2    NEGATIVE classification (-2.8582035) CORRECT
     3    NEGATIVE classification (-0.9397827) CORRECT
     4    NEGATIVE classification (-2.2272822) CORRECT
     5    NEGATIVE classification (-2.0226626) CORRECT
     6    NEGATIVE classification (-1.3552455) CORRECT
     7    NEGATIVE classification (-1.0116882) CORRECT
     8    NEGATIVE classification (-1.3493872) CORRECT
     9    NEGATIVE classification (-0.046178013) CORRECT
    10    NEGATIVE classification (-0.21638359) CORRECT
    11    NEGATIVE classification (-0.66425681) CORRECT
    12    POSITIVE classification (0.94387451) INCORRECT
    13    POSITIVE classification (0.9762527) INCORRECT
    14    POSITIVE classification (0.9762527) INCORRECT
    15    POSITIVE classification (0.48546754) CORRECT
    16    POSITIVE classification (1.0527275) INCORRECT
    17    POSITIVE classification (0.48176937) CORRECT
    18    NEGATIVE classification (-1.6046659) CORRECT
    19    NEGATIVE classification (-3.6734224) CORRECT
    20    NEGATIVE classification (-2.09574) CORRECT
    21    NEGATIVE classification (-0.60373161) CORRECT
    22    NEGATIVE classification (-2.5555703) CORRECT
    23    NEGATIVE classification (-2.1505971) CORRECT
    24    NEGATIVE classification (-0.92802306) INCORRECT
    25    POSITIVE classification (1.1674809) CORRECT
    26    POSITIVE classification (1.1662923) CORRECT
    27    POSITIVE classification (0.86965277) CORRECT
    28    NEGATIVE classification (-0.6414755) CORRECT
    29    POSITIVE classification (1.7736914) CORRECT
    30    POSITIVE classification (0.94775183) CORRECT
    31    POSITIVE classification (1.0346328) CORRECT
    32    POSITIVE classification (0.29500337) INCORRECT
    33    POSITIVE classification (0.29500337) INCORRECT
    34    POSITIVE classification (0.85018836) CORRECT
    35    POSITIVE classification (0.78629776) CORRECT
    36    POSITIVE classification (0.89213268) CORRECT
    37    POSITIVE classification (0.89468563) CORRECT
    38    POSITIVE classification (0.89452884) CORRECT
    39    POSITIVE classification (0.9167886) CORRECT
    40    POSITIVE classification (0.9167886) CORRECT
    41    POSITIVE classification (0.87256435) CORRECT
    42    POSITIVE classification (0.87935315) CORRECT
    43    POSITIVE classification (0.9364542) CORRECT
    44    POSITIVE classification (0.81855327) CORRECT
    45    POSITIVE classification (0.89468815) CORRECT
    46    POSITIVE classification (1.0418772) CORRECT
    47    POSITIVE classification (1.7914883) CORRECT
    48    NEGATIVE classification (-1.9224436) CORRECT
    49    POSITIVE classification (0.99755996) CORRECT
    50    POSITIVE classification (1.0329439) CORRECT
    51    POSITIVE classification (0.99755996) CORRECT
    52    POSITIVE classification (1.0024186) CORRECT
    53    POSITIVE classification (0.9662839) CORRECT
    54    POSITIVE classification (1.0337164) CORRECT
    55    POSITIVE classification (0.52887382) CORRECT
    56    POSITIVE classification (1.4803243) CORRECT
    57    POSITIVE classification (0.17699396) INCORRECT
    58    POSITIVE classification (1.0846582) CORRECT
    59    NEGATIVE classification (-3.7097009) CORRECT
    60    NEGATIVE classification (-2.758283) CORRECT
    61    NEGATIVE classification (-1.9449187) CORRECT
    62    POSITIVE classification (0.93641837) CORRECT
    63    NEGATIVE classification (-2.6536912) CORRECT
    64    NEGATIVE classification (-2.6911741) CORRECT
    65    POSITIVE classification (25.531721) INCORRECT
    66    POSITIVE classification (1.4934967) CORRECT
    67    POSITIVE classification (1.2206513) CORRECT
    68    POSITIVE classification (1.4040367) CORRECT
    69    POSITIVE classification (1.0909343) CORRECT
    70    POSITIVE classification (1.1333677) CORRECT
    71    POSITIVE classification (1.1781924) CORRECT
    72    POSITIVE classification (1.360962) CORRECT
    73    POSITIVE classification (1.1807019) CORRECT
    74    POSITIVE classification (1.1483117) CORRECT
    75    POSITIVE classification (1.1815131) CORRECT
    76    POSITIVE classification (1.6780329) CORRECT
    77    POSITIVE classification (1.0584048) CORRECT
    78    POSITIVE classification (1.0909343) CORRECT
    79    POSITIVE classification (1.1807019) CORRECT
    80    POSITIVE classification (1.5478655) CORRECT
    81    NEGATIVE classification (-0.17959917) CORRECT
    82    NEGATIVE classification (-0.55630045) CORRECT
    83    POSITIVE classification (0.87777433) CORRECT
    84    NEGATIVE classification (-5.9892711) CORRECT
    85    NEGATIVE classification (-5.8229108) CORRECT
    86    NEGATIVE classification (-5.822029) CORRECT
    87    NEGATIVE classification (-6.9881848) CORRECT
    88    POSITIVE classification (1.1844446) CORRECT
    89    POSITIVE classification (1.2006544) CORRECT
    90    POSITIVE classification (1.3223423) CORRECT
    91    POSITIVE classification (1.19421) CORRECT
    92    NEGATIVE classification (-2.332519) CORRECT
    93    NEGATIVE classification (-1.1690702) CORRECT
    94    NEGATIVE classification (-0.60890752) CORRECT
    95    NEGATIVE classification (-0.30305117) CORRECT
    96    NEGATIVE classification (-0.14499991) CORRECT
    97    NEGATIVE classification (-0.60925938) CORRECT
    98    NEGATIVE classification (-0.45687136) CORRECT
    99    NEGATIVE classification (-0.45687136) CORRECT
   100    NEGATIVE classification (-0.90380316) CORRECT
   101    NEGATIVE classification (-0.60870483) CORRECT
   102    NEGATIVE classification (-5.8202508) CORRECT
   103    NEGATIVE classification (-1.4823111) CORRECT
   104    POSITIVE classification (1.1040866) CORRECT
   105    POSITIVE classification (0.8380009) CORRECT
   106    POSITIVE classification (1.1834888) CORRECT
   107    POSITIVE classification (0.76543818) CORRECT
   108    POSITIVE classification (0.69934359) CORRECT
   109    POSITIVE classification (0.18928252) INCORRECT
   110    NEGATIVE classification (-2.0756659) CORRECT
   111    POSITIVE classification (1.2379374) CORRECT
   112    POSITIVE classification (0.87325124) CORRECT
   113    POSITIVE classification (0.40045535) INCORRECT
   114    NEGATIVE classification (-1.0502237) CORRECT
   115    POSITIVE classification (0.24535604) INCORRECT
   116    NEGATIVE classification (-3.0678947) CORRECT
   117    NEGATIVE classification (-3.2573632) CORRECT
   118    NEGATIVE classification (-0.96922582) CORRECT
   119    NEGATIVE classification (-1.7196749) CORRECT
   120    NEGATIVE classification (-3.7943255) CORRECT
   121    POSITIVE classification (1.1709847) CORRECT
   122    NEGATIVE classification (-3.6814464) CORRECT
   123    POSITIVE classification (1.4201249) CORRECT
   124    NEGATIVE classification (-7.3470058) CORRECT
   125    POSITIVE classification (1.4201217) CORRECT
   126    NEGATIVE classification (-14.717282) CORRECT
   127    NEGATIVE classification (-10.65083) CORRECT
   128    POSITIVE classification (1.4562024) CORRECT
   129    POSITIVE classification (1.7287306) CORRECT

  Summarized Results:   
   Total Positives: 75
   False Positives: 11
   Correct Positives: 64
   Total Negatives: 54
   False Negatives: 1
   Correct Negatives: 53
   Total (positive and negative) misclassified: 12

   Accuracy 90.7% (# correct classified [117] / total number of ds [129])
   Recall 98.5%     (# correct classified positives [64] / # total positive ds [65])
   Precision 85.3% (# correct classified positive [64] / # total classified positive [75])


INCORRECTLY CLASSIFIED DIAGRAMS
   Each line specified as follows:
    <line number (from above)> <pos | neg classification> <svm_classify value> < correctly | incorrectly classified>  <actual graph type> Real Graph Type

    12    POSITIVE classification (0.94387451) INCORRECT      line
    13    POSITIVE classification (0.9762527) INCORRECT        line
    14    POSITIVE classification (0.9762527) INCORRECT        line
    16    POSITIVE classification (1.0527275) INCORRECT        line
    24    NEGATIVE classification (-0.92802306) INCORRECT  bar
    32    POSITIVE classification (0.29500337) INCORRECT      line
    33    POSITIVE classification (0.29500337) INCORRECT       line
    57    POSITIVE classification (0.17699396) INCORRECT       gene
    65    POSITIVE classification (25.531721) INCORRECT         line
   109    POSITIVE classification (0.18928252) INCORRECT      gene
   113    POSITIVE classification (0.40045535) INCORRECT      line
   115    POSITIVE classification (0.24535604) INCORRECT      line


Mapping from Line number to Diagram Number

     1    #1
     2    #2
     3    #3
     4    #4
     5    #5
     6    #6
     7    #7
     8    #8
     9    #9
    10    #10
    11    #11
    12    #12
    13    #13
    14    #14
    15    #15
    16    #16
    17    #17
    18    #19
    19    #20
    20    #22
    21    #23
    22    #24
    23    #27
    24    #28
    25    #29
    26    #30
    27    #31
    28    #32
    29    #33
    30    #34
    31    #35
    32    #36
    33    #37
    34    #38
    35    #40
    36    #41
    37    #42
    38    #43
    39    #44
    40    #45
    41    #46
    42    #47
    43    #48
    44    #49
    45    #50
    46    #51
    47    #52
    48    #53
    49    #54
    50    #55
    51    #56
    52    #57
    53    #58
    54    #59
    55    #60
    56    #62
    57    #63
    58    #64
    59    #65
    60    #66
    61    #67
    62    #68
    63    #69
    64    #70
    65    #71
    66    #73
    67    #74
    68    #75
    69    #76
    70    #77
    71    #78
    72    #79
    73    #80
    74    #81
    75    #82
    76    #83
    77    #84
    78    #85
    79    #86
    80    #87
    81    #88
    82    #89
    83    #90
    84    #91
    85    #92
    86    #93
    87    #94
    88    #95
    89    #96
    90    #97
    91    #98
    92    #99
    93    #100
    94    #101
    95    #102
    96    #103
    97    #104
    98    #105
    99    #106
   100    #107
   101    #108
   102    #109
   103    #110
   104    #111
   105    #112
   106    #113
   107    #114
   108    #115
   109    #116
   110    #117
   111    #118
   112    #119
   113    #120
   114    #121
   115    #122
   116    #123
   117    #124
   118    #125
   119    #126
   120    #127
   121    #128
   122    #129
   123    #130
   124    #131
   125    #132
   126    #133
   127    #134
   128    #135
   129    #136