程式扎記: [ ML 小學堂 ] Linear classification using the perceptron

Linear classification using the perceptron :
Perceptron 透過 Hyperplane 對已經 classified 的 instances 進行切割, 如果 training data 是 linearly separable 的話, 可以透過這個演算法使用公式:

(w0=1, x0 為常數)

對給予的 input (x0,x1...xk) 進行分類. 底下我們將實作一個最簡單版本的 Perceptron 來認識該演算法是如何運作的, 首先來看看這個演算法的 Pseudo code :

Here, a1, a2, . . ., ak are the attribute values, and w0, w1, . . ., wk are the weights that define the hyperplane. We will assume that each training instance i1, i2, . . . is extended by an additional attribute a0 that always has the value 1. This extension, which is called the bias, just means that we don’t have to include an additional constant element in the sum. If the sum is greater than zero, we will predict the first class; otherwise, we will predict the second class.We want to find values for the weights so that the training data is correctly classified by the hyperplane.

而針對上面 Pseudo code 的說明, 你可以想成每次在做 classify 時, 只要判斷錯誤, 因為 Instance 的值我們是不能動的, 所以能動的就是 weighting vector 形成的 Hyperplane. 所以我們就在每次 mis-classify 時動態調整 weighting vector 的值來移動 Hyperplane. 示意圖如下 :

Simplest implementation of Perceptron :
底下便是實作的部分, 使用的語言是 Java. 沒有太多特別的地方, 就跟 Pseudo code 說的一樣. 這裡我們定義類別 SimpleInst 來承裝 training data :

view plaincopy to clipboardprint?
package ml.supervised.perceptron;  
  
import java.util.HashMap;  
import java.util.Iterator;  
import java.util.Map.Entry;  
  
public class SimpleInst implements Inst{  
    public static int MAX_FEATURE_SIZE=-1;  
    public int cls = -1;  
    public double lastSum = -1;  
    public HashMap values = new HashMap();   
  
    public SimpleInst(int c, Double...values){  
        this.MAX_FEATURE_SIZE=values.length;  
        this.setValues(c, values);  
    }  
      
    public SimpleInst(int...values){  
        this.MAX_FEATURE_SIZE=values.length-1;  
        cls = values[0];  
        for(int i=1; ithis.values.put(i-1, (double)values[i]);  
    }  
      
    @Override  
    public String toString()  
    {  
        StringBuffer strBuf = new StringBuffer("");  
        Iterator> iter = values.entrySet().iterator();  
        strBuf.append("Inst(");  
        boolean flag = true;  
        Entry ety;  
        while(iter.hasNext())  
        {  
            ety = iter.next();  
            if(flag==true)  
            {  
                flag = false;  
                strBuf.append(ety.getValue());  
            }  
            else  
            {  
                strBuf.append(String.format(", %.01f", ety.getValue()));  
            }  
        }  
        strBuf.append(")");  
        return strBuf.toString();  
    }  
      
    @Override  
    public int classify(IWeight weight) {  
        Object sum = classifyInReal(weight);  
        if(sum!=null)  
        {             
            if((Double)sum>0) return 1; // Class1  
            else return 0; // Class2  
        }  
        return -1;  
    }  
  
    @Override  
    public int size() {  
        return MAX_FEATURE_SIZE;  
    }  
  
    public double[] values()  
    {  
        double vals[] = new double[MAX_FEATURE_SIZE];  
        for(int i=0; i
            Double d  = (Double)valueAt(i);  
            if(d!=null)  
            {  
                vals[i] = d;  
            }  
            else  
            {  
                System.out.printf("\t[Error] idx=%d has no value!\n", i);  
            }  
        }  
        return vals;  
    }  
      
    @Override  
    public Object valueAt(int idx) {  
        return values.get(idx);  
    }  
  
    @Override  
    public boolean setValues(int c, Object... inValues) {  
        if(inValues.length == MAX_FEATURE_SIZE)  
        {  
            for(int i=0; i
            this.cls = c;  
            return true;  
        }  
        return false;  
    }  
  
    @Override  
    public boolean isCC(IWeight weight) {  
        return classify(weight)==cls;  
    }  
  
    @Override  
    public int cls() {  
        return cls;  
    }  
  
    @Override  
    public Object classifyInReal(IWeight weight) {  
        if(weight.size()==this.size())  
        {  
            double sum = 0;  
            for(int i=0; i
            {                 
                sum+=((Double)valueAt(i))*weight.w(i);  
                //System.out.printf("w%d=%.0f ; x%d=%.0f -> sum=%.0f", i, weight.w(i), i, (Double)valueAt(i), sum);  
            }  
            lastSum = sum;  
            return sum;  
        }  
        else  
        {  
            System.out.printf("\t[Error] Weighting vector(%d) has different size with input data(%d)!\n", weight.size(), size());  
        }  
        return null;  
    }  
}  

在 weighting vector 的部分, 這裡定義了類別 SimpleWht :

view plaincopy to clipboardprint?
package ml.supervised.perceptron;  
  
import java.util.Comparator;  
import java.util.HashMap;  
import java.util.Iterator;  
import java.util.Map.Entry;  
import java.util.PriorityQueue;  
  
public class SimpleWht implements IWeight{  
    public static int MAX_FEATURE_SIZE=-1;  
    public HashMap weights = new HashMap();   
      
    public static class Bean implements Comparable  
    {  
        int key;  
        double value;  
        public Bean(int k, double v){key = k; value = v;}  
        @Override  
        public int compareTo(Bean o) {  
            if(key > o.key) return 1;  
            else if(key < o.key) return -1;  
            return 0;  
        }  
    }  
      
    public SimpleWht(int size){  
        //System.out.printf("\t[Test] %d\n", weights.size());  
        this.MAX_FEATURE_SIZE=size;  
        zero();  
        //setW(0, 1); // w0=1  
    }  
      
    @Override  
    public double w(int idx) {  
        return weights.get(idx);  
    }  
  
    @Override  
    public double plus(int idx, double val) {  
        return weights.put(idx, weights.get(idx)+val);  
    }  
  
    @Override  
    public double minus(int idx, double val) {  
        return weights.put(idx, weights.get(idx)-val);  
    }  
  
    @Override  
    public double setW(int idx, double val) {  
        weights.put(idx, Double.valueOf(val));  
        return 0;  
    }  
  
    @Override  
    public void zero() {  
        for(int i=0; i0.0);  
    }  
  
    @Override  
    public int size() {  
        return MAX_FEATURE_SIZE;  
    }  
  
    @Override  
    public void plus(double... vals) {  
        for(int i=0; i
    }  
  
    @Override  
    public void minus(double... vals) {  
        for(int i=0; i
    }  
      
    @Override  
    public String toString()  
    {  
        StringBuffer strBuf = new StringBuffer();  
        Iterator> iter = weights.entrySet().iterator();  
        while(iter.hasNext())  
        {  
            Entry ety = iter.next();  
            strBuf.append(String.format("w%d=%.01f ", ety.getKey(), ety.getValue()));  
        }         
        return strBuf.toString();  
    }  
}  

接著下面是上述 Pseudo code 的實現 :

view plaincopy to clipboardprint?
public static int MAX_LOOP = 20;   
  
/** 
* BD : 根據  Linear function f = a0 * w0 + a1 * w1 + a2 * w2 + a3 * w3 + a4 * w4  
*      決定  instance 屬於類別 class1(1) 或  class2(0). 如果  f>0  則為類別 class1, 否則為 class2.    
* @param args 
*/  
public static void main(String[] args) throws Exception{  
    IWeight     ws = new SimpleWht(5);  
      
    /*定義 training instance list*/    
    List instList = new LinkedList();  
    instList.add(new SimpleInst(0, 1, 2, 1, 0, -1)); // Class0: (1, 2, 1, 0, -1)  
    instList.add(new SimpleInst(0, 1, 3, 5, 0, -4));   
    instList.add(new SimpleInst(1, 1, 0, -2, 9, 4)); // Class1: (1, 0, -2, 9, 4)  
    instList.add(new SimpleInst(1, 1, 0, -5, 4, 8));  
    instList.add(new SimpleInst(1, 1, 0, 0, 1, 4));  
    instList.add(new SimpleInst(0, 1, 1, 3, 0, 0));  
      
    /*Perceptron algorithm*/  
    int cnt=0;  
    boolean isDone;  
    while(cnt
    {  
        isDone = true;  
        for(Inst ist:instList)  
        {                 
            if(!ist.isCC(ws)) // Not classified correctly  
            {  
                if(ist.cls()>0)  
                {  
                    /*Instance 為  class1(1) 但被判為 class2(0) 時, 將 instance 的 attribute 加到  weighting.*/  
                    // w.x < 0 -> class2 -> wrong  
                    System.out.printf("\t[Info] %s is misclassified as class2(%.2f)!\n", ist, ((SimpleInst)ist).lastSum);  
                    ws.plus(((SimpleInst)ist).values());  
                    isDone = false;  
                }  
                else  
                {  
                    /*Instance 為  class2(0) 但被判為 class1(1) 時, 將 instance 的 attribute 從  weighting 減掉.*/  
                    // w.x > 0 -> class1 -> wrong  
                    System.out.printf("\t[Info] %s is misclassified as class1(%.2f)!\n", ist, ((SimpleInst)ist).lastSum);  
                    //System.out.printf("\t[Info] %s is classified correctly!\n", ist);  
                    ws.minus(((SimpleInst)ist).values());  
                    isDone = false;  
                }  
                System.out.printf("\t[Info] Modify ws -> %s\n", ws);  
            }  
            else  
            {  
                System.out.printf("\t[Info] %s is classified correctly!\n", ist);  
            }  
        }  
        System.out.printf("===== Round %d done =====\n", cnt+1);  
        if(isDone) break;  
        cnt++;  
    }  
      
    /*Print classified result*/  
    System.out.printf("\t[Info] Weighting vector:\n%s\n", ws);  
      
    for(int i=0; i
    {  
        int clsInPredict = instList.get(i).classify(ws);  
        int groundTruth = instList.get(i).cls();  
        System.out.printf("\t[Info] Inst(%d) is %s as '%s' (Ground trouth is '%s').\n", i,  
                          clsInPredict==groundTruth?"classified":"misclassified",                                                                     
                          cis(clsInPredict),  
                          cis(groundTruth));  
    }  
}  
  
public static String cis(int cls)  
{  
    if(cls > 0) return "class1";  
    else return "class2";  
}  

執行結果如下, 很幸運的是我們在 Round2 就找到可以正確的 weighting vector 對所有 training data 進行 classify :

[Info] Inst(1.0, 2.0, 1.0, 0.0, -1.0) is classified correctly!
[Info] Inst(1.0, 3.0, 5.0, 0.0, -4.0) is classified correctly!
[Info] Inst(1.0, 0.0, -2.0, 9.0, 4.0) is misclassified as class2(0.00)!
[Info] Modify ws -> w0=1.0 w1=0.0 w2=-2.0 w3=9.0 w4=4.0
[Info] Inst(1.0, 0.0, -5.0, 4.0, 8.0) is classified correctly!
[Info] Inst(1.0, 0.0, 0.0, 1.0, 4.0) is classified correctly!
[Info] Inst(1.0, 1.0, 3.0, 0.0, 0.0) is classified correctly!
===== Round 1 done =====
[Info] Inst(1.0, 2.0, 1.0, 0.0, -1.0) is classified correctly!
[Info] Inst(1.0, 3.0, 5.0, 0.0, -4.0) is classified correctly!
[Info] Inst(1.0, 0.0, -2.0, 9.0, 4.0) is classified correctly!
[Info] Inst(1.0, 0.0, -5.0, 4.0, 8.0) is classified correctly!
[Info] Inst(1.0, 0.0, 0.0, 1.0, 4.0) is classified correctly!
[Info] Inst(1.0, 1.0, 3.0, 0.0, 0.0) is classified correctly!
===== Round 2 done =====
[Info] Weighting vector:
w0=1.0 w1=0.0 w2=-2.0 w3=9.0 w4=4.0
[Info] Inst(0) is classified as 'class2' (Ground trouth is 'class2').
[Info] Inst(1) is classified as 'class2' (Ground trouth is 'class2').
[Info] Inst(2) is classified as 'class1' (Ground trouth is 'class1').
[Info] Inst(3) is classified as 'class1' (Ground trouth is 'class1').
[Info] Inst(4) is classified as 'class1' (Ground trouth is 'class1').
[Info] Inst(5) is classified as 'class2' (Ground trouth is 'class2').

Supplement :
* Wiki : Winnow (algorithm)

The winnow algorithm is a technique from machine learning for learning a linear classifier from labeled examples. It is very similar to the perceptron algorithm. However, the perceptron algorithm uses an additive weight-update scheme, while Winnow uses a multiplicative scheme that allows it to perform much better when many dimensions are irrelevant (hence its name).

* [ ML In Action ] Predicting numeric values : regression - Linear regression (1)
* [ ML In Action ] Logistic Regression

程式扎記

標籤

2012年9月17日星期一

[ ML 小學堂 ] Linear classification using the perceptron

沒有留言:

張貼留言

[Git 常見問題] error: The following untracked working tree files would be overwritten by merge

檢舉濫用情形

學習筆記

標籤

2012年9月17日 星期一