Gaussian vs. Multinomial Naive Bayes
Naive Bayes comes in several flavours depending on the distribution assumed for the feature likelihoods — the most common are Gaussian (for continuous data) and Multinomial (for count data).
Gaussian Naive Bayes
Gaussian NB assumes each feature follows a Gaussian (normal) distribution within each class. It estimates the mean and variance of each feature per class from training data and uses the Gaussian PDF to compute likelihoods.
Gaussian NB in Practice
Multinomial Naive Bayes
Multinomial NB models features as counts (e.g., word frequencies in a document). It is the standard choice for text classification with bag-of-words or TF-IDF features.
Multinomial NB for Text
Bernoulli NB: A Third Option
Bernoulli NB is designed for binary feature vectors (word presence/absence rather than counts). It penalises the non-occurrence of a word, which can improve performance on short documents where absence is informative.