A.T. Fomenko, Geometry and Probability,
Ink and pencil on paper, 38 x 47.5 cm, 1987
Pasha Zusmanovich

6PAS2 Probability and Statistic 2,
summer semester 2025/2026

Wednesday 09:10-12:25   G302  
Wednesday April 22: no class (rektorské volno)

APPROXIMATE SYLLABUS
  1. The subject of statistics. Types of data.
  2. Basic notions of descriptive statistics: mean, median, mode, quantiles, standard deviation.
  3. Correlation, correlation matrices.
  4. Normal distribution. Confidence interval.
  5. Various types of distributions.
  6. Null and alternative hypotheses. Hypothesis testing.
  7. Statistical models. Linear regression. Least squares.
  8. Clustering.
  9. Working with R.

A parallel Czech-English text covering some of the topics in probablity and statistics is available on the page of Probability & Statistics 1.

LITERATURE All the books are available in electronic form in multiple places.          = available in the university library

SOFTWARE:   R
Online sources:

A BRIEF SYNOPSIS

Class 1 February 11, 2026 (~1.5h)
Organizational issues. R.

Class 2 February 18, 2026 (~2h)
Discussion of homeworks 1-2. Mode. Skewness and kurtosis. Meaning and importance of the central limit theorem.
R code

Class 3 February 25, 2026 (~2h)
Discussion of homeworks 3-4. "Paradoxes" in statistics (according to T. Tao, Compactness and Contradiction, AMS 2013, Section 6.5). Weighted mean. Population weighted density. Simpson's paradox.
R code

Class 4 March 4, 2026 (~2.5h)
Discussion of homeworks 5-6. Central limit theorem (recap, according to Crawley, The R book, p.278). Why in the nonbiased estimator of the standard deviation we have \(n-1\) in the denominator, and not \(n\). Confidence interval (according to Dalgaard, pp.63-64). Tests for normality: normal scores, Q-Q plots (according to Cohen-Cohen, pp.220-223).
R code

Class 5 March 11, 2026 (~3h)
Discussion of homeworks 7-9. Correlation, correlation matrices. Iterated correlation matrices (according to Chen).

Class 6 March 18, 2026 (~2.5h)
Discussion of homeworks 10-12. Statistical models. Linear regression. Least squares. Residuals. (According to Albert and Rizzo, Chapter 7).

Class 7 March 25, 2026 (~3h)
Discussion of homeworks 10-14.

Class 8 April 1, 2026 (~3h)
Discussion of homeworks 12,14-16. Generalized additive models (after Crawley, Statistics: An Introduction Using R, pp.146-148).
R code

Class 9 April 8, 2026 (~1.5h)
Discussion of homeworks 17-18. Clustering (hierarchical, K-means, gravitational).

Class 10 April 15, 2026 (~2.5h)
Discussion of homeworks 15-20. Null and alternative hypotheses, p-values.

HOMEWORKS
Your final score for the course will be the number of points earned for doing homeworks, scaled at the 0-100 scale.



Old material from "VAPMS Selected Applications of Mathematical Statistics", summer semester 2018/2019
A brief synopsys: pdf    Homeworks


Created: Wed Sep 23 2015
Last modified: Wed Apr 15 2026 15:57:55 CEST