Tag: statistics

  • Minkowski distance

    Minkowski distance is generalization of Euclidean distance, L1, and manhattan(or city block) distance, L2. I saw this equation before, but didn’t know its name yet, so I’m writing it here.

  • Naive bayes with expectation maximization

    http://pages.cs.wisc.edu/~jerryzhu/cs769/em.pdfBy using EM algorithm, one can run naive bayes algorithm with partially labeled data(or with unlabeled data).

  • Pseudo Sigma

    While reading books on EDA(Exploratory Data Analysis), one of the interesting things was pseudo sigma. It’s a standard deviation like measure which is resistant to noises or outliers. Simply put, given the first quartile H1 and the third quartile H3, pseudo sigma is (H3-H1)/1.35. Why? It’s because H1= μ – 0.675σ and H3 = μ + 0.675σ if X…