Eleni Koutsomitopoulou - Ελένη Κουτσομητοπούλου
  • Home
  • Class material
    • Course Syllabus
    • Course Calendar
    • Sample Lecture
    • Sample Quiz
    • Additional Course Syllabi >
      • Corpus Linguistics
      • Machine Learning
      • Statistical NLP
  • Blog
  • Contact

Probability vs. Statistics: a basic mis-conception

3/20/2013

0 Comments

 
With the popularity of statistical NLP methods and probabilistic models of natural language I have found the distinction below is paramount in the heads of some linguists.

Probability is a theoretical branch of mathematics dealing with the prediction of the likelihood of *future* events, and therefore is useful for the evaluation of the consequences of mathematical definitions.

Statistics involves the analysis of the frequency of *past* events, it is data-driven and therefore an applied branch of mathematics and is useful for the analysis of events based on observation of cumulative data about them.

Some people (linguists) further confuse in the above discussion the distinction between data and algorithms in modern NLP. This is a whole other topic
also related to the Norvig-Chomsky debate I mentioned in a previous blog post. In a later post I will explain some further common misconceptions. 


0 Comments



Leave a Reply.

    What is this space for

     A computational linguist's pond. Sometimes informational, some others instructional, or just a vent about the state of the field nowadays. 

    Archives

    April 2016
    March 2013
    January 2013
    December 2012
    September 2012

    Categories

    All

    RSS Feed

Powered by Create your own unique website with customizable templates.