Experimentations Glossary
Concept | Definition |
---|---|
Alternative hypothesis | What will happen according to the hypothesis, the opposite of the null hypothesis. e.g. 'Students who sleep more perform better on tests.' |
Averages | Central values of data. e.g. mean, median, mode - measures of central tendency |
Between participants | Dividing participants into groups for each condition of the experiment. e.g. Group A receives a drug, Group B receives a placebo. |
Bimodal | If there are two modes, and they're clearly higher (peeks) than other values. Unimodal (1 mode) and multimodal (multiple modes) |
Categorical & ranking data | Non-numeric data grouped by category or order. e.g. blood type (categorical), race position (ranking). |
Confounding variables | Uncontrolled variables that affect the outcome. e.g. diet affecting weight loss in an exercise study. |
Correlational co-efficient | Strength and direction of variable relationship. e.g. Pearson's r = 0.85 indicates strong positive correlation. week13 |
Dependent variable | The measured outcome, e.g. heart rate change |
Descriptive statistics | Descriptions of the data from the sample. e.g. a graph or table of data, mean or median. |
Distribution | How data values are spread. e.g. normal, skewed, or bimodal. |
Experimental design | A research design involving manipulation of an independent variable. e.g. testing whether caffeine improves memory by assigning participants to caffeine or no caffeine groups. |
Falsification | Concept that a hypothesis must be disprovable. e.g. 'All swans are white' can be falsified by finding one black swan. |
Frequency distribution | Counts of values in intervals. e.g. histogram of age groups. |
Generalizing | Generalising from the sample to the population |
Global Samples | Diverse samples from various locations. e.g. participants from multiple countries. |
Graph types | Scatter plot, histogram, bar chart. |
Hypothesis testing | Statistical procedure to determine support for a hypothesis. e.g. testing if a new drug reduces symptoms more than a placebo. |
Hypothesis: one tailed | A hypothesis that predicts the direction of the two variables, e.g. extra cheese leads to worst sleep |
Hypothesis: two tailed | A hypothesis that is neutral about the direction or relationship of the two variables |
Independent Variable | The manipulated variable. e.g. film category watched |
Inferential statistics | Statistical tools that allow researchers to their sample to the population. |
Line of best fit | Line approximating data trend in a scatterplot. e.g. average trend in exam scores vs. study time. |
Mean | Sum of values divided by number. e.g. average test score. |
Median | Middle value in an ordered dataset. e.g. the 5th value in a list of 9 scores. |
Measures of central tendency | Mode, median, mean |
Mode | The most common result, e.g. scores 1,3, 5, 6, 6, 7 it would be 6. There would be no mode if they're all unique. |
NHST | TODO |
Normal distribution | The mean, median, and mode are equal. The data is symmetric (bell curve). The data tapers off gradually. AKA Gaussian distribution |
P-Value | Probability between 0 and 1 |
Qualitative data | Descriptive data. e.g. interviews. |
Quantitative data | Numerical data. e.g. height, temperature. |
Quasi experimental | An experimental design without random assignment. e.g. comparing student performance across two existing classes. |
Range | Difference between highest and lowest values. e.g. if scores range from 60 to 90, the range is 30. |
Sample | Selecting a subset of a population. e.g. choosing 100 students from a university. |
Sampling variation | Each time you choose 10 subjects for your sample, you might get variation in the mean of the value you want to measure, e.g. height. It might also be different from the population. Sample size is important, and inferential statistics are used to generalise to the population. |
Sampling error | Error from using a sample instead of the full population. e.g. survey results differ from actual population values. |
Sampling error: Type 1 | Rejecting a true null hypothesis. e.g. concluding a drug works when it doesn't. |
Sampling error: Type 2 | Failing to reject a false null hypothesis. e.g. missing the effect of a useful drug. |
Scale | Measurement system classifying data. e.g. nominal, ordinal, interval, ratio scales. |
Scatterplot | Graph showing a two variable relationship. e.g. height vs. weight. |
Standard deviation | Spread of data around the mean. e.g. SD = 2 means most scores lie within 2 points of the mean. |
Statistical significance | In psychology, statistical significance occurs when the p-value, calculated from a test like chi-squared, is less than 0.05 (5%). This means there's less than a 5% chance the results are due to random variation if the null hypothesis is true, so we reject it for the alternative hypothesis. |
Within participants | The same group of people are affected by different conditions of the independent variable, e.g. juror's guilty verdict before and after deliberation |
Value types
Concept | Definition |
---|---|
Nominal | Categories with no order (e.g., types of fruit, colours). |
Ordinal | Data with a clear order, the interval between each value isn't always equal (e.g., rankings, scariest horror films, GCSE, A Level, BSc, Ph.D.). |
Interval | Ordered items with equal spacing, and no true zero (e.g., temperature in Celsius, IQ, dates in a calendar). |
Ratio | Ordered, equal spacing, and has a true zero (e.g. height, money, weight, age). You can form ratios from the data e g 50kg is double 25kg. |
Statistical test types / designs
Concept | Definition |
---|---|
Chi-squared | Test for relationship between categorical variables. e.g. testing if gender is related to voting preference. |
Correlational | A type of research design that examines the relationship between variables without manipulation. e.g. measuring the relationship between hours studied and exam scores. |
Independent t-test | |
Paired samples t-test | |
Pearson correlation test | when the data are interval or ratio |
Sampling Error
The group in the study is the sample. The wider group is the population, which depends on the research question. The sample is generalised to the population.
Sampling error is the natural variability between a sample estimate and the true population value.
Let’s say the true average IQ in a country is 100.
- A small sample of 10 people might give you an average of 105 or 95 — lots of variation.
- A large sample of 1,000 people might give you 99.8 or 100.2 — closer to the true value.
- Both have sampling error, but the larger sample has less.
Formula
Standard Error (SE) = s / sqrt(n) Where: 𝑠 = sample standard deviation 𝑛 = sample size
Reducing error
Reducing sampling error: - Larger sample size - Random sampling to reduce bias - Stratified sampling (key groups being represented)