Andrew Gurung
  • Introduction
  • Data Science
    • Natural Language Processing
      • Sentiment analysis using Twitter
    • Linear Algebra
      • Linear algebra explained in four pages
      • Vectors
        • Vector Basics
        • Vector Projection
        • Cosine Similarity
        • Vector Norms and Orthogonality
        • Linear combination and span
        • Linear independence and Basis vectors
      • Matrices
        • Matrix Arithmetic
        • Matrix Operations
        • Functions and Linear Transformations
        • Matrix types
      • Eigendecomposition, Eigenvectors and Eigenvalues
      • Principle Component Analysis (PCA)
      • Singular-Value Decomposition(SVD)
      • Linear Algebra: Deep Learning Book
    • Calculus
      • Functions, Limits, Continuity and Differentiability
      • Scalar Derivative and Partial Derivatives
      • Gradient
      • Matrix Calculus
      • Maxima and Minima using Derivatives
      • Gradient Descent and its types
    • Statistics and Probability
      • Probability Rules and Axioms
      • Types of Events
      • Frequentist vs Bayesian View
      • Random Variables
      • MLE, MAP, and Naive Bayes
      • Probability Distributions
      • P-Value and hypothesis test
    • 7 Step DS Process
      • 1: Business Requirement
      • 2: Data Acquisition
      • 3: Data Processing
        • SQL Techniques
        • Cleaning Text Data
      • 4: Data Exploration
      • 5: Modeling
      • 6: Model deployment
      • 7: Communication
    • Miscellaneous
      • LaTeX commands
  • Computer Science
    • Primer
      • Big O Notation
  • Life
    • Health
      • Minimalist Workout Routine
      • Reddit FAQ on Nootropics
      • Hiking/Biking Resources
    • Philosophy
      • Aristotle's Defense of Private Property
    • Self-improvement
      • 100 Mental Models
      • Don't break the chain
      • Cal Newport's 5 Productivity tips
      • Andrew Ng's advice on deliberate practice
      • Atomic Habits
      • Turn sound effects off in Outlook
    • Food and Travel
      • 2019 Guide to Pesticides in Produce
      • Recipe
        • Spicy Sesame Noodles
      • Travel
        • Hiking
    • Art
      • Scott Adams: 80% of the rules of good writing
      • Learn Blues Guitar
    • Tools
      • Software
        • Docker
        • Visual Studio Code
        • Terminal
        • Comparing Git Workflow
      • Life Hacks
        • DIY Deck Cleaner
  • Knowledge Vault
    • Book
      • The Almanack of Naval Ravikant
    • Media
    • Course/Training
Powered by GitBook
On this page
  • Hypothesis test
  • Null Hypothesis
  • Alternative Hypothesis
  • P-Value

Was this helpful?

  1. Data Science
  2. Statistics and Probability

P-Value and hypothesis test

PreviousProbability DistributionsNext7 Step DS Process

Last updated 6 years ago

Was this helpful?

Hypothesis test

Hypothesis test are setup to determine the validity of a statistical claim. Every hypothesis test contains two sets of opposing statements/hypothesis:

  • Null Hypothesis

  • Alternative Hypothesis

Null Hypothesis

The null hypothesis states that the population parameter is equal to the claimed value. For example, if the claim is that average time to cook pizza is 5 minutes, the notation for null hypothesis would be:

Alternative Hypothesis

You need to define an opposing statement/hypothesis, in cases where null hypothesis fails. There can be three possibilities for an alternative hypothesis.

  1. The population parameter is not equal to the claimed value

  2. The population parameter is greater than the claimed value

  3. The population parameter is less than the claimed value

P-Value

P-value is used as the cutoff point for rejecting the null hypothesis. A greater p-value means there is stronger evidence in the favor of the null hypothesis.

In a statistical hypothesis test, p-value is the level of marginal significance representing a given event's probability of occurrence.

The p-value is a number between 0 and 1 and interpreted in the following way:

  • A small p-value (typically ≤ 0.05) indicates strong evidence against the null hypothesis, so you reject the null hypothesis.

  • A large p-value (> 0.05) indicates weak evidence against the null hypothesis, so you fail to reject the null hypothesis.

  • p-values very close to the cutoff (0.05) are considered to be marginal (could go either way).

Note: Always report the p-value so your readers can draw their own conclusions.

Link:

https://www.dummies.com/education/math/statistics/what-a-p-value-tells-you-about-statistical-data/
(The population mean is also 5 minutes)