Skip to main content
 Statistical & Data Sciences

Statistical & Data Sciences

The Statistical & Data Sciences (SDS) Program links faculty and students from across the college interested in learning things from data. At Smith, students learn statistics by doing—class time emphasizes problem-solving and hands-on contact with data. Many courses employ student-driven projects that allow students to pursue their interest in fields such as economics, psychology, political science, sociology, engineering, biology, environmental science, neuroscience and geology.

Upcoming Events, Talks & Lectures

Upcoming Talks & Lectures

The Statistical & Data Sciences Program hosts regular talks & lectures that are free and open to the public. No prior exposure to statistics is presumed. Stay tuned to our events page for exciting presentations coming up!

Intermittent Events

Please see the Western Mass Statistics and Data Science Meetup for additional events.

Requirements & Courses

Goals for Majors in Statistical and Data Sciences

  • Identify and work with a wide variety of data types (including, but not limited to, categorical, numerical, text, spatial and temporal) and formats (e.g. CSV, XML, JSON, relational databases, audio, video, etc.). 
  • Extract meaningful information from data sets that have a variety of sizes and formats.
  • Fit and interpret statistical models, including but not limited to linear regression models. Use models to make predictions, and evaluate the efficacy of those models and the accuracy of those predictions.
  • Understand the strengths and limits of different research methods for the collection, analysis and interpretation of data. Be able to design studies for various purposes.
  • Attend to and explain the role of uncertainty in inferential statistical procedures.
  • Read and understand data analyses used in research reports. Contribute to the data analysis portion of a research project in at least one applied discipline.
  • Compute with data in at least one high-level programming language, as evidenced by the ability to analyze a complex data set.
  • Work in multiple languages and computational environments.
  • Convey quantitative information in written, oral and graphical forms of communication to both technical and nontechnical audiences.
  • Assess the ethical implications to society of data-based research, analyses, and technology in an informed manner. Use resources, such as professional guidelines, institutional review boards, and published research, to inform ethical responsibilities.

Statistical and Data Sciences Major

Requirements

Ten courses plus SDS 100 

  1. SDS 100
  2. Five foundations and core courses: 
    1. CSC 110 or CSC 120
    2. SDS 192
    3. MTH 211
    4. SDS 220 or SDS 201
    5. SDS 291
  3. One programming depth course: SDS 270, SDS 271CSC 210, or CSC 294
  4. One statistics depth course: SDS 290SDS 293/ CSC 293MTH 320/ SDS 320 or a topic of SDS 390
  5. One data in context course: SDS 109/ CSC 109FYS 189CSC 235/ SDS 235SDS 236 or SDS 237
  6. One application domain course. A student and their advisor should identify potential application domains of interest as early as possible, since many suitable courses will have prerequisites. Normally, this should happen during the fourth semester or at the time of major declaration, whichever comes first. The determination of whether a course satisfies the requirement will be made by the student’s major advisor. The requirement is normally satisfied by one of the following:
    1. A topic of SDS 300
    2. A research seminar (normally 300-level) or special studies of at least two credits. Normally, the domain would be outside of mathematics, statistics, and computer science.
    3. A departmental honors thesis in another major (normally not MTH or CSC).
  7. One capstone course: SDS 410
  8. Electives (as needed to fulfill the 10-course requirement): Provided that the requirements listed above are met, any of the courses listed above may be counted as electives to reach the 10-course requirement. Five College courses in statistics and computer science may be taken as electives. Additionally, any course crosslisted with SDS may be counted toward completion of the major.
Additional Guidelines 
  • With the exception of SDS 192, SDS 201, SDS 220, SDS 291, and SDS 410 (or any mandatory S/U course), students may switch any one of the remaining SDS courses from graded to S/U. If a student converts from an SDS minor to an SDS major after a core course has been taken with the S/U grading option, that will count as the student's one S/U course for the major.
  • SDS 201 may be replaced by a 4 or 5 on the AP statistics exam. Replacement by AP courses does not diminish the total number of courses required for either the major or the minor.
  • MTH 211 may be replaced by petition in exceptional circumstances.
  • Any one of ECO 220, GOV 203, PSY 201 or SOC 204 may directly substitute for SDS 220 without the need to take another course, in both the major and minor. Note that SDS 220 and ECO 220 require Calculus.
  • Note that non 100-level prerequisites for the application domain course will count as inside the major credits. 
  • Five College equivalents may substitute with permission of the program.

Mathematical Statistics Major

Information on the interdepartmental major in mathematical statistics can be found on the Mathematical Sciences page of this catalog.

Statistical and Data Sciences Minor

Requirements

Six courses plus SDS 100

  1. SDS 100
  2. Four foundation and core courses: 
    1. CSC 110 or CSC 120
    2. SDS 192
    3. SDS 220 or SDS 201
    4. SDS 291
  3. One programming depth course: SDS 270, SDS 271CSC 210, or CSC 294
  4. One data in context course: CSC 109/ SDS 109, FYS 189, CSC 235/ SDS 235, SDS 236 or SDS 237
  • Should these three requirements be fulfilled by fewer than six courses, any of the courses in SDS or CSC that count towards the major may be counted towards the minor.
  • Normally, no more than one course graded S/U will be counted toward the minor. 
  • If a student converts from an SDS minor to an SDS major after a core course has been taken with the S/U grading option, that S/U will count as the student's one S/U for the major. 

Applied Statistics Minor

The interdepartmental minor in applied statistics offers students a chance to study statistics in the context of a field of application of interest to the student. The minor is designed with enough flexibility to allow a student to choose among many possible fields of application.

Requirements

Five courses plus SDS 100

  1. SDS 100
  2. One introductory statistics course: SDS 201SDS 220PSY 201ECO 220SOC 204 or GOV 203
  3. SDS 290 and SDS 291
  4. Two application domain courses. A student and their adviser should identify potential application domains of interest as early as possible, since many suitable courses will have prerequisites. The determination of whether a course satisfies the requirement will be made by the student’s minor adviser. The requirement is normally satisfied by two of the following:
    1. A topic of SDS 300
    2. A research seminar (normally 300-level) or special studies of at least two credits. Normally, the domain would be outside of mathematics, statistics, and computer science.
    3. A departmental honors thesis in another major (normally not MTH or CSC).
  • Among the courses used to satisfy the student’s major requirement, a maximum of two courses can count towards the minor.
  • Only one introductory statistics course may count toward the minor.
  • Normally, no more than one course graded S/U will be counted towards the minor.
  • If a student converts from an SDS minor to an SDS major after a core course has been taken with the S/U grading option, that S/U will count as the student's one S/U for the major. 
  • Students who have taken AP Statistics in high school and received a 4 or 5 on the AP Statistics Examination, or who have had other equivalent preparation in statistics, are not required to repeat the introductory statistics course, but they are required to complete five courses.

Courses

SDS 100 Laboratory: Reproducible Scientific Computing with Data (1 Credit)

The practice of data science rests upon computing environments that foster responsible uses of data and reproducible scientific inquiries. This course develops students’ ability to engage in data science work using modern workflows, open-source tools and ethical practices. Students learn how to author a scientific report written in a lightweight markup language (e.g., markdown) that includes code (e.g., R), data, graphics, text and other media. Students also learn to reason about ethical practices in data science. S/U only. Concurrent registration required in any of: SDS 192, SDS 201, SDS 220, SDS 290 or SDS 291. Restrictions: Not open to students who have already completed any of: SDS 192, SDS 201, SDS 220, SDS 290 or SDS 291. Enrollment limited to 30. Students not registered for a corequisite course will be dropped without notification.

Fall, Spring

SDS 109/ CSC 109 Communicating with Data (4 Credits)

Offered as SDS 109 and CSC 109. The world is growing increasingly reliant on collecting and analyzing information to help people make decisions. Because of this, the ability to communicate effectively about data is an important component of future job prospects across nearly all disciplines. In this course, students learn the foundations of information visualization and sharpen their skills in communicating using data. This course explores concepts in decision-making, human perception, color theory and storytelling as they apply to data-driven communication. This course helps students build a strong foundation in how to talk to people about data, for both aspiring data scientists and students who want to learn new ways of presenting information. Enrollment limited to 40. {M}

Fall, Spring

SDS 192 Introduction to Data Science (4 Credits)

An introduction to data science using Python, R and SQL. Students learn how to scrape, process and clean data from the web; manipulate data in a variety of formats; contextualize variation in data; construct point and interval estimates using resampling techniques; visualize multidimensional data; design accurate, clear and appropriate data graphics; create data maps and perform basic spatial analysis; and query large relational databases. Prerequisite: concurrent registration in SDS 100 required for students who have not previously completed SDS 201, SDS 220, SDS 290 or SDS 291. {M}

Fall, Spring

SDS 201 Statistical Methods for Undergraduates (4 Credits)

(Formerly MTH 201/ PSY 201). An overview of the statistical methods needed for undergraduate research, emphasizing methods for data collection, data description and statistical inference, including an introduction to study design, confidence intervals, testing hypotheses, analysis of variance and regression analysis. Techniques for analyzing both quantitative and categorical data are discussed. Applications are emphasized and students use R for data analysis. This course satisfies the basic requirement for the psychology major. Students who have taken MTH 111 or equivalent should take SDS 220, which also satisfies the basic requirement. Prerequisite: concurrent registration in SDS 100 required for students who have not completed SDS 192, SDS 220, SDS 290 or SDS 291. Restrictions: Students do not normally earn credit for more than one course on this list: ECO 220, GOV 203, MTH 220, PSY 201, SDS 201, SDS 220 or SOC 204. Enrollment limited to 40. {M}

Fall, Spring

SDS 220 Introduction to Probability and Statistics (4 Credits)

(Formerly MTH 220/SDS 220). An application-oriented introduction to modern statistical inference: study design, descriptive statistics, random variables, probability and sampling distributions, point and interval estimates, hypothesis tests, resampling procedures, and multiple regression. A wide variety of applications from the natural and social sciences are used. This course satisfies the basic requirement for biological science, engineering, environmental science, neuroscience, and psychology. Prerequisite: MTH 111, or equivalent; SDS 100 must be taken concurrently for students who have not completed SDS 192, SDS 201, SDS 290 or SDS 291. Restrictions: Students do not normally earn credit for more than one course on this list: ECO 220, GOV 203, MTH 220, PSY 201, SDS 201, SDS 220 or SOC 204. Enrollment limited to 40. {M}

Fall, Spring

SDS 235/ CSC 235 Visual Analytics (4 Credits)

Offered as CSC 235 and SDS 235. Visual analytics techniques can help people to derive insight from massive, dynamic, ambiguous and often conflicting data. During this course, students learn the foundations of the emerging, multidisciplinary field of visual analytics and apply these techniques toward a focused research problem in a domain of personal interest. Students who elect to take this course as a programming intensive course should have previously taken CSC 212. In this track, students learn to use R, Python and HTML5/JavaScript to develop custom visual analytic tools. Students preferring a non-programming intensive track may elect to use existing visual analytic software, such as Tableau or Plotly. Designations: Theory, Programming. Prerequisite: CSC 120 or equivalent. {M}

Fall, Spring, Variable

SDS 236 Data Journalism (4 Credits)

Data journalism is the practice of telling stories with data. This course will focus on journalistic practices, interviewing data as a source, and interpreting results in context. We will discuss the importance of audience in a journalistic context, and will focus on statistical ideas of variation and bias. The course will include hands-on work with data, using appropriate computational tools such as R, Python, and data APIs. In addition, we will explore the use of visualization and storytelling tools such as Tableau, plot.ly, and D3. No prior experience with programming or journalism is required. Prerequisites: An introductory statistics course (including SDS 220, SOC 204, GOV 203, ECO 220, PSY 201). Enrollment limited to 20. WI {M}

Fall, Spring, Variable

SDS 237 Data Ethnography (4 Credits)

This course introduces the theory and practice of data ethnography, demonstrating how qualitative data collection and analysis can be used to study data settings and artifacts. Students will learn techniques in field-note writing, participant observation, in-depth interviewing, documentary analysis and archival research and how they may be used to contextualize the cultural underpinnings of datasets. Students will learn how to visualize datasets in ways that foreground their sociopolitical provenance in R. Students will also learn how ethnographic methods can be leveraged to improve data documentation and communication. The course will introduce debates regarding the politics of technoscientific fieldwork. Recommended prerequisite: SDS 192. Enrollment limited to 40. {S}

Fall, Spring

SDS 238 Community-Based Data Science (4 Credits)

This course introduces concepts in human-centered design and design justice, considering how their principles can be applied in the context of community-based data science work. Students learn how to define social problems, engage stakeholders, design data science solutions, and evaluate social impact. Students also learn techniques in collaborative data science project planning and execution, engaging best practices (e.g. version control and code review) in the context of a community-based data science project. Strategies for effectively communicating project approach, outcomes, and impact are addressed throughout the course. Enrollment limited to 24.

Fall, Spring, Alternate Years

SDS 270 Programming for Data Science in R (4 Credits)

This course is not about data analysis—rather, students learn the R programming language at a deep level. Topics may include data structures, control flow, regular expressions, functions, environments, functional programming, object-oriented programming, debugging, testing, version control, documentation, literate programming, code review and package development. The major goal for the course is to contribute to a viable, collaborative, open-source, publishable R package. Prerequisites: SDS 192 and CSC 110, or equivalent. Enrollment limited to 40. {M}

Fall, Spring

SDS 271 Programming for Data Science in Python (4 Credits)

This course covers the skills and tools needed to process, analyze and visualize data in Python and work on collaborative projects. Topics include functional and object oriented programming in Python, data wrangling in Pandas, visualization in Matplotlib in seaborn, as well as creating a reproducible workflow: debugging, testing and documenting programs, and effectively using version control. The major goal for the course is to create a viable, open-source Python package like those in the Python Package Index (PyPI). Prerequisites: SDS 192 and CSC 110. Enrollment limited to 40. (E) {M}

Fall, Spring, Variable

SDS 290 Research Design and Analysis (4 Credits)

(Formerly MTH/SDS 290). A survey of statistical methods needed for scientific research, including planning data collection and data analyses that provide evidence about a research hypothesis. The course can include coverage of analyses of variance, interactions, contrasts, multiple comparisons, multiple regression, factor analysis, causal inference for observational and randomized studies and graphical methods for displaying data. Special attention is given to analysis of data from student projects such as theses and special studies. Statistical software is used for data analysis. Prerequisites: One of the following: PSY 201, SDS 201, GOV 203, ECO 220, SDS 220 or a score of 4 or 5 on the AP Statistics examination or the equivalent; concurrent registration in SDS 100 required for students who have not completed SDS 192, SDS 201, SDS 220 or SDS 291. Enrollment limited to 40. {M}

Fall, Spring

SDS 291 Multiple Regression (4 Credits)

(Formerly MTH 291/ SDS 291). Theory and applications of regression techniques: linear and nonlinear multiple regression models, residual and influence analysis, correlation, covariance analysis, indicator variables and time series analysis. This course includes methods for choosing, fitting, evaluating and comparing statistical models and analyzes data sets taken from the natural, physical and social sciences. Prerequisite: SDS 201, PSY 201, GOV 203, SDS 220, ECO 220 or equivalent or a score of 4 or 5 on the AP Statistics examination; concurrent registration in SDS 100 required for students who have not completed SDS 192, 201, 220 or 290. Enrollment limited to 40. {M}{N}

Fall, Spring

SDS 293/ CSC 293 Machine Learning (4 Credits)

Offered as CSC 293 and SDS 293. The field of statistical learning encompasses a variety of computational tools for modeling and understanding complex data. In this introductory course, we will explore many of the most popular of these tools, such as sparse regression, classification trees, boosting and support vector machines. In addition to unpacking the mathematics underlying the computational methods, students will also gain hands-on experience in applying these techniques to real datasets using R. Prerequisite: SDS 201, SDS 220 or CSC 210, or equivalent intro statistics course. Enrollment limited to 60. {M}

Fall, Spring, Annually

SDS 300di Seminar: Topics in the Applications of Statistics and Data Science-Disability Inclusion and Data Analytics (4 Credits)

Students learn the social model of disability and critical disability theory as well as research design and process, and work on a research project analyzing disability inclusion public data. The statistical methods covered in this course may include logistic regression, multivariate analysis, factor analysis, etc. Students are expected to submit their final projects to a journal, conference or competition by the end of the semester. Prerequisite: SDS 201, SDS 220 or ECO 220. Restrictions: Juniors and seniors only. Enrollment limited to 15. Instructor permission required. {M}

Fall, Spring, Variable

SDS 300fs Seminar: Topics in the Applications of Statistics and Data Science-Understanding Food Systems through Engaging the Data (4 Credits)

This course examines the global Food System, with a focus on the US, through the examination of established data used to study the system and recommend food policy. In the United States, the US Department of Agriculture (USDA) oversees much of the food system, both promoting food products and regulating health impacts. In general, the operation of these systems generates a vast amount of data, much of which is in large open or semi-open online databases. Researchers and policy makers draw on these databases to aid in their decision making. This course aims to familiarize students with the data and its uses. Prerequisite: SDS 201 or SDS 220. Restrictions: Juniors and seniors only. Enrollment limited to 15. Instructor permission required. {M}{S}

Fall, Spring, Alternate Years

SDS 320/ MTH 320 Mathematical Statistics (4 Credits)

Offered as MTH 320 and SDS 320. An introduction to the mathematical theory of statistics and to the application of that theory to the real world. Discussions include functions of random variables, estimation, likelihood and Bayesian methods, hypothesis testing and linear models. Prerequisites: a course in introductory statistics, MTH 212 and MTH 246, or equivalent. Enrollment limited to 20. {M}

Spring

SDS 338/ GOV 338 Research Seminar in Political Networks (4 Credits)

Offered as GOV 338 and SDS 338. How does the behavior of a state, politician or interest group affect the behavior of others? Does Massachusetts’s decision to legalize recreational marijuana influence Vermont’s marijuana policies? From declarations of war to the decision of who congress members' voting alignments, social scientists are increasingly looking to political networks to recognize the inter-connectedness of the world. This course presents the essentials of social network analysis and how they can be applied to American politics. Prerequisites: SDS 220 or an equivalent introductory statistics course. Restrictions: Juniors and seniors only. Enrollment limited to 12. Instructor permission required. {S}

Fall, Spring, Alternate Years

SDS 364/ PSY 364 Research Seminar: Intergroup Relationships (4 Credits)

Offered as PSY 364 and SDS 364. Research on intergroup relationships and an exploration of theoretical and statistical models used to study mixed interpersonal interactions. Example research projects include examining the consequences of sexual objectification for both women and men, empathetic accuracy in interracial interactions and gender inequality in household labor. A variety of skills including, but not limited to, literature review, research design, data collection, measurement evaluation, advanced data analysis and scientific writing are developed. Prerequisites: PSY 201, SDS 201, SDS 220 or equivalent; and PSY 202. Restrictions: Juniors and seniors only. Enrollment limited to 12. Instructor permission required. {M}{N}{S}

Fall, Spring, Alternate Years

SDS 390be Topics in Statistical and Data Sciences-Methods in Biostatistics and Epidemiology (4 Credits)

Epidemiology concerns the distribution and determinants of disease in human populations, while biostatistics focuses on the development and application of statistical methods to a wide range of topics in biology, medicine and public health. This course focuses on foundational concepts in epidemiology, including measures of association and common epidemiological study designs, and statistical methods for public health data. Discussions include categorical data analysis (contingency table analysis, multinomial regression, ordinal regression and Poisson regression) and survival analysis (Kaplan-Meier estimators and Cox proportional hazards models). No background in biology is expected or required. Prerequisites: SDS 291 and [MTH 112 or (MTH 111 and MTH 153)]. Restrictions: SDS 390 may be taken a total of 3 times with different topics. Enrollment limited to 30. {M}

Fall, Spring, Variable

SDS 390cd Topics in Statistical and Data Sciences-Categorical Data Analysis (4 Credits)

Theory and applications of statistical methods for the analysis of categorical data. The course includes an overview of statistical methods for analyzing discrete data including binary, multinomial and count response variables. Nominal and ordinal responses are considered. Discussions may include contingency table and chi-squared analyses, logistic, Poisson and negative-binomial regression models. R statistical software is used. Prerequisites: SDS 291 or SDS 290 or equivalent. Restrictions: SDS 390 may be taken a total of 3 times with different topics. Enrollment limited to 30.

Fall, Spring, Variable

SDS 400 Special Studies (1-4 Credits)

Normally for juniors and seniors. Instructor permission required.

Fall, Spring

SDS 410 Capstone in Statistical & Data Sciences (4 Credits)

This one-semester course leverages students’ previous coursework to address a real-world data analysis problem. Students collaborate in teams on projects sponsored by academia, government or industry. Professional skills developed include: ethics, project management, collaborative software development, documentation and consulting. Regular team meetings, weekly progress reports, interim and final reports, and multiple presentations are required. Open only to Statistical and Data Science majors. Prerequisites: SDS 192, SDS 291 and CSC 111. Restrictions: Statistical and Data Science majors only. Enrollment limited to 20. Instructor permission required. {M}

Fall, Spring

SDS 430D Honors Thesis (4 Credits)

Department permission required.

Fall, Spring

Crosslisted Courses 

CSC 109/ SDS 109 Communicating with Data (4 Credits)

Offered as SDS 109 and CSC 109. The world is growing increasingly reliant on collecting and analyzing information to help people make decisions. Because of this, the ability to communicate effectively about data is an important component of future job prospects across nearly all disciplines. In this course, students learn the foundations of information visualization and sharpen their skills in communicating using data. This course explores concepts in decision-making, human perception, color theory and storytelling as they apply to data-driven communication. This course helps students build a strong foundation in how to talk to people about data, for both aspiring data scientists and students who want to learn new ways of presenting information. Enrollment limited to 40. {M}

Fall, Spring

CSC 235/ SDS 235 Visual Analytics (4 Credits)

Offered as CSC 235 and SDS 235. Visual analytics techniques can help people to derive insight from massive, dynamic, ambiguous and often conflicting data. During this course, students learn the foundations of the emerging, multidisciplinary field of visual analytics and apply these techniques toward a focused research problem in a domain of personal interest. Students who elect to take this course as a programming intensive course should have previously taken CSC 212. In this track, students learn to use R, Python and HTML5/JavaScript to develop custom visual analytic tools. Students preferring a non-programming intensive track may elect to use existing visual analytic software, such as Tableau or Plotly. Designations: Theory, Programming. Prerequisite: CSC 120 or equivalent. {M}

Fall, Spring, Variable

CSC 252 Algorithms (4 Credits)

Covers algorithm design techniques ("divide-and-conquer," dynamic programming, "greedy" algorithms, etc.), analysis techniques (including big-O notation, recurrence relations), useful data structures (including heaps, search trees, adjacency lists), efficient algorithms for a variety of problems and NP-completeness. Designation: Theory. Prerequisites: CSC 210, MTH 111 and MTH 153. Enrollment limited to 30. {M}

Fall, Spring, Alternate Years

CSC 294 Computational Machine Learning (4 Credits)

An introduction to machine learning from a programming perspective. Students develop an understanding of the basic machine learning concepts (including underfitting/overfitting, measures of model complexity, training/test set splitting and cross validation), but with an explicit focus on machine learning systems design (including evaluating algorithmic complexity and development of programming architecture) and on machine learning at scale. Principles of supervised and unsupervised learning are demonstrated via an array of machine learning methods including decision trees, k-nearest neighbors, ensemble methods and neural-networks/deep-learning, as well as dimension reduction, clustering and recommender systems. Students implement classic machine learning techniques, including gradient descent. Designations: Theory, Programming. Prerequisites: CSC 210, CSC 250, (MTH 112 or MTH 211), and knowledge of Python. Enrollment limited to 40. {M}

Fall, Spring, Annually

CSC 325 Seminar: Responsible Computing (4 Credits)

When is disruption good? Who is responsible for ensuring that an innovation has a positive impact? Are these impacts shared equitably? How can bias be eliminated from algorithms, if they exist? What assurances can anyone make about the technology they develop? What are the limitations of professional ethics? This seminar examines the ethical implication (i.e., ethics, justice, political philosophy) of computing and automation. Participants explore how to design technology responsibly while contributing to progress and growth. Discussions include: intellectual property; privacy, security and freedom of information; automation; globalization; access to technology; artificial intelligence; mass society; and emerging issues. Prerequisite: CSC 210. Restrictions: Juniors and seniors only. Enrollment limited to 12. Instructor permission required. {S}

Fall, Spring, Variable

ECO 220 Introduction to Statistics and Econometrics (5 Credits)

Summarizing, interpreting and analyzing empirical data. Attention to descriptive statistics and statistical inference. Topics include elementary sampling, probability, sampling distributions, estimation, hypothesis testing and regression. Assignments include use of statistical software to analyze labor market and other economic data. Prerequisite: ECO 150 or ECO 153. Restrictions: Students do not normally earn credit for more than one course on this list: ECO 220, GOV 203, MTH 220, PSY 201, SDS 201, SDS 220 or SOC 204. Enrollment limited to 55. {M}{S}

Fall, Spring, Annually

ECO 240 Econometrics (4 Credits)

This course offers an introduction to the basic principles of econometrics and the methods used to present and analyze economic data. Knowledge of statistical methods is essential for understanding and evaluating critically much of what is written about economics and social policy. The main goal of the course is for you to leave it as an informed and critical consumer of empirical studies and with the foundational skills to conduct your own original empirical research. Prerequisites: ECO 150, ECO 153, MTH 111 and either ECO 220, SDS 220 or SDS 291. {M}{S}

Fall, Spring, Annually

FYS 189 Data and Social Justice (4 Credits)

Students examine sociopolitical forces that impact the availability, structure and governance of data regarding various social justice issues. Students learn techniques for presenting data in ways that foreground the contexts of data production and remain accountable to diverse communities. Datasets about health equity, housing justice, environmental justice and carceral justice are studied, analyzed and visualized. Students identify institutions and stakeholders involved in data production, unpack the vested interests animating data semantics, consider what people and problems get erased in data structuring and evaluate ethical tradeoffs that data scientists grapple with as they plan for data presentation. Restrictions: First years only; students are limited to one first-year seminar. Enrollment limited to 16. WI {S}

Fall, Spring, Variable

GOV 203 Empirical Methods in Political Science (5 Credits)

The fundamental problems in summarizing, interpreting and analyzing empirical data. Discussions include research design and measurement, descriptive statistics, sampling, significance tests, correlation and regression. Special attention is paid to survey data and to data analysis using computer software. Restrictions: Students do not normally earn credit for more than one course on this list: ECO 220, GOV 203, MTH 220, PSY 201, SDS 201, SDS 220 or SOC 204. Enrollment limited to 75. {M}{S}

Spring

GOV 338/ SDS 338 Research Seminar in Political Networks (4 Credits)

Offered as GOV 338 and SDS 338. How does the behavior of a state, politician or interest group affect the behavior of others? Does Massachusetts’s decision to legalize recreational marijuana influence Vermont’s marijuana policies? From declarations of war to the decision of who congress members' voting alignments, social scientists are increasingly looking to political networks to recognize the inter-connectedness of the world. This course presents the essentials of social network analysis and how they can be applied to American politics. Prerequisites: SDS 220 or an equivalent introductory statistics course. Restrictions: Juniors and seniors only. Enrollment limited to 12. Instructor permission required. {S}

Fall, Spring, Alternate Years

MTH 246 Probability (4 Credits)

An introduction to probability, including combinatorial probability, random variables, discrete and continuous distributions. Prerequisites: MTH 153 and MTH 212 (may be taken concurrently), or equivalent. {M}

Fall

MTH 320/ SDS 320 Mathematical Statistics (4 Credits)

Offered as MTH 320 and SDS 320. An introduction to the mathematical theory of statistics and to the application of that theory to the real world. Discussions include functions of random variables, estimation, likelihood and Bayesian methods, hypothesis testing and linear models. Prerequisites: a course in introductory statistics, MTH 212 and MTH 246, or equivalent. Enrollment limited to 20. {M}

Spring

MTH 354 Mathematics of Deep Learning (4 Credits)

The developments of Artificial Intelligence (AI) are tied to an unprecedented reshaping of the human experience throughout society, impacting the arts, literature, science, politics, commerce, law, education, etc. Despite these consequential effects, understanding of AI is mostly empirical. The state of knowledge of deep learning has been recently likened to a pseudo-science like alchemy. Progress in this direction rests on truly interdisciplinary approaches that are equally informed from mathematics, computer science, statistics and data science. The course goals are: (1) Understand the mathematical foundations of deep learning, (2) Develop proficiency in using mathematical tools to analyze deep learning algorithms, (3) Apply mathematical concepts to implement real-world applications of deep learning. Not recommended for first-years. Prerequisites: MTH 211 and MTH 212. Enrollment limited to 12. {M}

Fall, Spring, Variable

PSY 201 Statistical Methods for Undergraduate Research (5 Credits)

An overview of the statistical methods needed for undergraduate research emphasizing methods for data collection, data description and statistical inference including an introduction to study design, confidence intervals, testing hypotheses, analysis of variance and regression analysis. Techniques for analyzing both quantitative and categorical data are discussed. Applications are emphasized, and students use R and other statistical software for data analysis. This course satisfies the basis requirement for the psychology major. Students who have taken MTH 111 or the equivalent or who have taken AP STAT should take SDS 220, which also satisfies the major requirement. Restrictions: Students do not normally earn credit for more than one course on this list: ECO 220, GOV 203, MTH 220, PSY 201, SDS 201, SDS 220 or SOC 204. Enrollment limited to 40. {M}

Fall, Spring, Variable

PSY 364/ SDS 364 Research Seminar: Intergroup Relationships (4 Credits)

Offered as PSY 364 and SDS 364. Research on intergroup relationships and an exploration of theoretical and statistical models used to study mixed interpersonal interactions. Example research projects include examining the consequences of sexual objectification for both women and men, empathetic accuracy in interracial interactions and gender inequality in household labor. A variety of skills including, but not limited to, literature review, research design, data collection, measurement evaluation, advanced data analysis and scientific writing are developed. Prerequisites: PSY 201, SDS 201, SDS 220 or equivalent; and PSY 202. Restrictions: Juniors and seniors only. Enrollment limited to 12. Instructor permission required. {M}{N}{S}

Fall, Spring, Alternate Years

SOC 204 Statistics and Quantitative Research Methods for Sociology (5 Credits)

This project-based course covers the study of statistics for the analysis of sociological data and the study of methods for quantitative sociological research more generally. Topics in statistics include descriptive statistics, probability theory, correlation, deduction and induction, error and bias, confidence intervals and simple linear regression. Topics in research methods include positivism, research design, measurement, sampling methods and survey design. All students participate in a lab which emphasizes the use of computer software to analyze real data. Students design and complete a survey research project over the course of the semester. Prerequisite: SOC 101. Restrictions: Students do not normally earn credit for more than one course on this list: ECO 220, GOV 203, MTH 220, PSY 201, SDS 201, SDS 220 or SOC 204. Enrollment limited to 40. {M}{S}

Fall

Additional Programmatic Information

It is possible for a Smith student to obtain a master of science in statistics from the University of Massachusetts Amherst in five years (four years at Smith plus one at UMass), through the Fifth Year MS in Statistics Program. Interested students should consult with the director of the program.

Students interested in pursuing graduate work in statistics or data science should consult with their major adviser to plan an appropriate course of study. In either case, a solid foundation in mathematics (calculus I, II, and III, as well as linear algebra) is essential.

Graduate Programs in Statistics

The ASA maintains several lists of graduate programs in statistics that may help you find options that suit your needs.

Graduate Programs in Data Science

As a newer discipline, programs in data science are still in their infancy. The ASA maintains a list of graduate programs in “Big Data”, although this should not be conflated with data science. A more comprehensive list of data science degree programs is maintained by datascience.community.

Advisers

Benjamin Baumer, Shiya Cao, Kaitlyn Cook, Randi Garcia, Albert Y. Kim, Katherine Kinnaird, Scott LaCombe, Lindsay Poirier. 

Choosing Your Adviser

If you wish to declare an SDS major or minor and need an advisor, please fill out the SDS major/minor adviser request form.

Students interested in doctoral programs in Statistics should consider the Major in Mathematical Statistics jointly operated by SDS and MTH.

Programs we have recently sent students to that have fulfilled major requirements:

  1. University of Edinburgh 

  2. London School of Economics

  3. University of Auckland

  4. DIS Copenhagen

  5. PRESHCO (Programa de Estudios Hispánicos en Córdoba)
    Language requirement- Spanish

For MST Majors:

  1. CIMAT- Mathematics and Stats program in Guanajuato, Mexico
    Courses taught in English

  2. Budapest Semesters in Mathematics

For more information about study abroad, see the Smith study abroad page or contact the SDS study abroad advisor, Scott LaCombe.

Students who received a score of 4 or 5 on the AP Statistics Exam should take SDS 290 (Research Design and Analysis) or SDS 291 (Multiple Regression).

Additional Course Information

Please see the SDS section of the online course catalog for the most recent information.

Choosing a First Statistics Course

A student who wishes to study statistics may place themself according to the following guidelines.

A student with prior work in calculus or discrete math at college should start with Introduction to Probability & Statistics (SDS 220 or SDS 201, 5 credits). This is the recommended statistics course for biological sciences majors, and satisfies the basis requirement for engineering, environmental science, neuroscience and psychology. This is also the recommended course for a student who took AP statistics but didn't take the exam, or received a grade of 3 or below. ECO 220 is also a course at this general level.

A student with four years of high school math (but little or no calculus) should select SDS 201 or PSY 201 (Statistical Methods for Undergraduates). SDS 201 also satisfies the basis requirement for psychology. Other introductory courses at this level include GOV 203 and SOC 201.

A student with less preparation should select SDS 107 (Statistical Thinking) or SDS 109 (Communicating with Data).

A student who received a score of 4 or 5 on the AP Statistics Exam should take SDS 290 (Research Design and Analysis) or SDS 291 (Regression Analysis).

Taking Statistics Away (EGR Majors)

Please see the guidelines for Picker Engineering majors.

Courses Offered Through the Program

  • SDS 107: Statistical Thinking
  • CSC/SDS 109: Communicating with Data
  • FYS 189: Data and Social Justice
  • SDS 192: Introduction to Data Science
  • SDS 201: Statistical Methods for Undergraduates
  • SDS 220: Introduction to Probability and Statistics
  • CSC/SDS 235: Visual Analytics
  • SDS 236: Data Journalism
  • SDS 237: Data Ethnography
  • SDS 270: Programming for Data Science in R
  • SDS 271: Programming for Data Science in Python
  • SDS 290: Research Design and Analysis
  • SDS 291: Multiple Regression
  • SDS 293: Modeling for Machine Learning
  • SDS 300: Applications of Statistical & Data Sciences
  • MTH/SDS 320: Mathematical Statistics
  • GOV/SDS 338: Research Seminar in Political Networks
  • CSC/SDS 352: Parallel and Distributed Computing
  • CSC/SDS 354: Seminar: Music Information Retrieval
  • PSY/SDS 364: Research Seminar on Intergroup Relationships
  • SDS 390: Topics in Statistical & Data Sciences
  • SDS 400: Special Studies
  • SDS 430D: Honors Thesis
  • SDS 410: Capstone

Cross-Listed Courses

  • CSC 111: Introduction to Computer Science Through Programming
  • CSC/MTH 205: Modeling in the Sciences
  • MTH 211: Linear Algebra
  • MTH 246: Probability
  • AST 200: Astronomical Data Science
  • CSC 212: Programming With Data Structures
  • CSC 230: Introduction to Database Systems
  • CSC 252: Algorithms
  • CSC 256: Intelligent User Interfaces
  • CSC 290: Artificial Intelligence
  • CSC 294: Computational Machine Learning
  • CSC 330: Database Systems
  • CSC 390: Seminar on Artificial Intelligence

This page is intended to help EGR majors and their advisers identify appropriate courses at other universities that will satisfy the statistics requirement for the EGR major. It supplements the memo sent to EGR faculty on February 20, 2017. As noted in that memo, “equivalence of courses taken elsewhere [are] determined by...[a] qualified member of the SDS program.” Herein, we delineate the criteria used to determine equivalence in order to promote transparency and ensure a uniform experience for all.

The following criteria are used to verify that a course taken to satisfy the statistics requirement for the EGR major (hereafter “COURSE”) is satisfactory:

  • Rigor: COURSE must be at or above the level of rigor of SDS 220. This is the primary criteria.
  • Statistical reasoning: COURSE must include statistical topics like hypothesis testing, confidence intervals, and regression—not just probability topics like random variables, distributions and expected value.

Exception: Students who have earned a 4 or 5 on the AP statistics exam can waive these requirements. They can fulfill their statistics requirement by taking any non-introductory course in probability or statistics (e.g., MTH 246, SDS 290, SDS 291, SDS 293, etc.).

SDS faculty will use the following set of questions to guide their thinking on whether a course meets the above criteria. Normally, a replacement course would satisfy all or nearly all of these questions.

  • Does COURSE cover most or all of the topics listed in the description for SDS 220?
    • An application-oriented introduction to modern statistical inference: study design, descriptive statistics; random variables; probability and sampling distributions; point and interval estimates; hypothesis tests, resampling procedures and multiple regression.
  • Does COURSE include linear regression as a topic in the syllabus?
  • Does COURSE use a comprehensive textbook?
  • Does COURSE include any prerequisites (e.g., calculus) that indicate mathematical maturity?
  • Is COURSE for statistical practice (like SDS 220) and not just for statistical concepts (like SDS 107)?
  • Does COURSE explicitly mention the use of a statistical computing environment like R, SPSS, Stata, JMP or SAS (that is, something beyond Excel or TI calculators)?
  • Does COURSE include the word “business” in the course title or textbook? Smith College does not give credit for business classes.

EGR majors should consult this page first, and then present a syllabus (preferably electronic) to the SDS study abroad adviser. Although SDS 220 is a 5-credit course, the number of credits is not a determining factor.

List of Previously Approved Courses

For reference only, we provide a list of previously approved courses. Courses change over time and vary by instructor -- students should understand that just because a course was previously approved in the past does not guarantee that it will be approved in the future.

Previously Approved

Previously Not Approved

  • STAT 240, UMass
    • This course is considered equivalent to SDS 201, which does NOT fulfill the statistics requirement for EGR. Note also that STAT 240 is only 3 credits, whereas SDS 220 is 5 credits. 
  • ECE 214: Introduction to Probability and Random Processes, UMass
    • This course would be considered equivalent to MTH 246. Note the Calc III requirement.

Fall 2023

  • SDS 100: Reproducible Scientific Computing with Data (Casey Baumer; Will Hopper; Nic Schwab)
  • FYS 189: Data and Social Justice (Lindsay Poirier)
  • SDS 192: Introduction to Data Science (Shiya Cao; Jared Joseph; Jared Joseph)
  • SDS 201: Introductory Statistics (Will Hopper)
  • SDS 220: Introduction to Probability and Statistics (Rebecca Kurtz-Garcia; Rebecca Kurtz-Garcia; Scott LaCombe)
  • SDS 237: Data Ethnography (Lindsay Poirier)
  • MTH 246: Probability (Kaitlyn Cook)
  • SDS 270: Advanced Programming for Data Science (Jared Joseph)
  • SDS 271: Programming for Data Science in Python (Casey Berger)
  • SDS 291: Multiple Regression (Kaitlyn Cook; Will Hopper)
  • SDS 300: Disability Inclusion and Data Analytics (Shiya Cao)
  • PSY/SDS 364: Research Seminar: Intergroup Relationships (Randi Garcia)
  • SDS 390: Ecological Forecasting (Albert Y. Kim)
  • SDS 400: Special Studies
  • SDS 404: Honors Thesis
  • SDS 410: Capstone (Albert Y. Kim)

Spring 2024

  • SDS 100: Reproducible Scientific Computing with Data (Justin Baumann; Katherine Kinnaird; Nic Schwab)
  • SDS 192: Introduction to Data Science (Shiya Cao; Jared Joseph)
  • SDS 201: Statistical Methods for Undergraduates (Will Hopper)
  • SDS 220: Introduction to Probability and Statistics (Albert Y. Kim; Albert Y. Kim; Rebecca Kurtz-Garcia)
  • SDS 270: Programming for Data Science in R (Jared Joseph)
  • SDS 290: Research Design and Analysis (Randi Garcia)
  • SDS 291: Multiple Regression (Will Hopper; Kaitlyn Cook)
  • SDS 293: Modeling for Machine Learning (Katherine Kinnaird)
  • SDS 300: Data Science for Coral Reef Conservation (Justin Baumann)
  • MTH/SDS 320: Mathematical Statistics (Rebecca Kurtz-Garcia)
  • SDS 390: Methods in Biostatistics and Epidemiology (Kaitlyn Cook)
  • SDS 400: Special Studies
  • SDS 404: Honors Thesis
  • SDS 410: Capstone (Shiya Cao)
Employment of statisticians is projected to grow 27 percent from 2012 to 2022, much faster than the average for all occupations. Growth is expected to result from more widespread use of statistical analysis to make informed business, healthcare, and policy decisions.
Bureau of Labor Statistics

Faculty

Ben Baumer

Statistical & Data Sciences

Professor of Statistical & Data Sciences

Ben Baumer

Shiya Cao

Statistical & Data Sciences

MassMutual Assistant Professor of Statistical and Data Sciences

Kaitlyn Cook

Statistical & Data Sciences

Assistant Professor of Statistical & Data Sciences

Randi L. Garcia

Psychology

Associate Professor of Psychology and of Statistical & Data Sciences

Randi Garcia

Will Hopper

Statistical & Data Sciences

Lecturer in Statistical & Data Sciences

Placeholder Image

Katherine M. Kinnaird

Computer Science

Clare Boothe Luce Assistant Professor of Computer Science and of Statistical & Data Sciences

Katherine Kinnaird

Rebecca Kurtz-Garcia

Mathematics & Statistics

Assistant Professor of Mathematical Science and Statistical & Data Sciences

Rebecca Kurtz-Garcia

Scott LaCombe

Government

Assistant Professor of Government and of Statistical & Data Sciences

Headshot of Scott LaCombe

Ab Mosca

Statistical & Data Sciences

Visiting Assistant Professor in Computer Science and Statistical & Data Sciences

Lindsay Poirier

Statistical & Data Sciences

Assistant Professor of Statistics & Data Sciences

Nicholas Schwab

Statistical & Data Sciences

Lecturer in Statistical & Data Sciences

Nicholas Schwab

Yujie Wang

Statistical & Data Sciences

Lecturer in Statistical & Data Sciences

Emeriti

Katherine Halvorsen
Professor Emerita of Mathematics and Statistics

Research Associate

Nicholas Horton
Research Associate in Statistical & Data Sciences

Program Committee

Shannon Audley 
Associate Professor of Education & Child Study 

R. Jordan Crouser 
Associate Professor of Computer Science 

Glenn Ellis 
Professor of Engineering 

Howard J. Gold
Professor of Government

Suzanne Z. Gottschang
Professor of Anthropology and of East Asian Studies

Mary Harrington
Tippit Professor in the Life Sciences (Psychology)
 

Caroline Melly
Associate Professor of Anthropology

Philip K. Peake
Professor of Psychology 

Cornelia Pearsall
Professor of English Language & Literature

Marney Pratt
Laboratory Instructor in Biological Sciences

Susan Sayre
Associate Professor of Economics

Vis Taraz
Assistant Professor of Economics

Terry-Ann Craigie
Associate Professor of Economics
 

Resources

Think statistics is just about calculating things? Think again. 

The fields of statistics and data science are growing exceptionally fast. As technology continues to reshape our world, more and more data are being collected on any number of subjects. There is a growing belief among decision makers that these data can be useful. Yet the process of transforming data into actionable information is challenging.

To analyze modern streams of data, government agencies, nonprofits (NGOs) and private industries seek data analysts with technical skills (programming ability), the ability to reason quantitatively about data and uncertainty, and strong communication skills (in written, oral and visual forms). People with these skills are in high demand.

Statisticians use their deep understanding of mathematics and probability theory to reason about variation and uncertainty in data. For example, if a drug was observed to have a positive effect on patient outcomes in a clinical trial, was that effect large enough—given the sample size and assumptions about how the data was collected—to justify concluding that the drug actually worked? Statisticians build, validate and interpret models. They design experiments and collaborate with scientists of all stripes to make precise estimates of unknown quantities.

Data science is an emerging field that combines elements of mathematics, statistics and computer science to extract meaning from data. Data scientists work with large, complex, messy and live data sources. Often working on questions that are not well-defined, data scientists use their creativity and technical ability to dig deep into "Big Data." They build models, make predictions, and develop static and dynamic ways to visualize data.

While at Smith, statistics students have created innovative classroom activities, authored honored theses, developed sophisticated statistical software and contributed to the Office of Institutional Research. Graduates have found internships with the National Institutes of Standards and Techonology and the New York Mets, employment with GoogleMIT’s Lincoln Laboratory, and MassMutual’s Data Science Development Program. Student have also gone on to graduate school at UC Berkeley, The Harvard School of Public Health, Ohio State University, and the University of Massachusetts.

Awards

STATISTICAL & DATA SCIENCES RESEARCH PRIZE

The program awards an annual prize to a student majoring in SDS who performed outstanding research during her time at Smith. The faculty meets every spring to review the research projects they have supervised and select the prize recipient, who is then notified at the annual Awards Ceremony the day before Commencement.

Awardees

  • 2024: Silvia Song '24
  • 2023: Quinn Alrise White '23
  • 2019: Julianna Alvord '19
  • 2018: Wencong (Priscilla) Li ’18
  • 2017: Weijia (Vega) Zhang ’17
FIVE COLLEGE STATISTICS PRIZE

The Five College Statistics prize is awarded to one student at each of the Five Colleges, at their discretion. The award may be presented to a student satisfying any one of the following criteria (based on faculty vote from that institution):

  • outstanding independent research, thesis or capstone course project in statistics
  • outstanding service to statistics on their campus (or across campuses)
  • outstanding use of Five College statistics resources
  • outstanding non-senior pursuing further study in statistics

Awardees

  • 2024: Christy Yang '24 and Vicky Xu '24 
  • 2023: Clara Se-pian Li '23 & Aushanae Haller '23 
  • 2019: Yue Kuang ’19, Zixian Li ’19 and Jingyi Liu ’19 (outstanding senior capstone project)
  • 2018: Paige Patrick AC and Rutendo Madziwo ’19 (outstanding service to statistics)
  • 2017: Abby Doctor ’17 (outstanding senior capstone project)
  • 2016: Yiwen Zhu ’16 and Emma Beauchamp ’16 (outstanding theses)
  • 2015: Weijia (Vega) Zhang ’18 (outstanding non-senior)
DEPARTMENTAL HONORS

2019

2018

2017

Competitions

ASA FIVE COLLEGE DATAFEST

DataFest is a national undergraduate data analysis competition hosted at UMass each spring, in which students work in teams of up to five to extract insights from a complex, challenging data set over a weekend.

Recent Winners from Smith

2023

  • Best In Group (Panel B) Sexy Data Scientists: Sarah Branch '24, Yaretsy Castro '25, Camila Maldonado '24, Su Than Thar Nyi '24, and Sonia Paredes '23

2018

  • Best in Group (Panel B): A-super-NOVA
    Audrey Bertin ’21, Riley Boeth ’18, Emma Livingston ’20, Clara Rosenberg ’20, Kara VanAllen ’20

2017

  • Best in Show: DataBest
    Zainab Aqdas Rizvi ’18, Subashini Sridhar ’18J, Ji Young Yun ’18, Ji Wong Chung ’18 and Van Nguyen ’18
  • Best Statistical Interpretation: Normally Distributed
    Zixian Li ’19, Angie Dinh ’17, Abby Doctor ’17, Erina Fukuda ’18 and Raeesa Alam ’19J
  • Best Business Insight: Standard Divination
    Isabella Zhu ’20, Cas Sweeney ’19, Sarah Abowitz ’20 and Garcia Sun ’20
UNDERGRADUATE STATISTICS PROJECT COMPETITION (USCLAP)

The class project competition is for undergraduate students who conduct projects as part of an introductory or intermediate level statistics course. Most projects submitted to the USCLAP competition involve analyzing real data using existing statistical techniques. Students may choose any topic on which to conduct a study and students may use existing data or collect their own.

Recent Winners from Smith

2017

2016

UNDERGRADUATE RESEARCH PROJECT COMPETITION (USRESP)

The research project competition is for undergraduate students who conduct research projects coming from activities like summer research projects, senior capstone course research projects, independent research projects (e.g., independent studies), Honor's college research projects, or extension of class research projects. Some submissions to USRESP are applied research projects using existing statistical/analytical techniques to solve real world problems, which others are involve methodological research involving statistical applications or simulation studies evaluating different techniques. Students may collect their own data or use existing data and students can select any topic to study.

Recent Winners from Smith

2015

  • 2nd Place: Sara Stoudt ’15: Geostatistical Models for the Spatial Distribution of Uranium in the Continental United States

Mu Sigma Rho is the national statistics honor society, and the Boston Chapter of the ASA oversees nominations from Smith each spring. We also maintain a complete archive of recipients

Recent Smith College inductees include:

  • Abigail O'Meara
  • Amber Liu 
  • Audrey Kim 
  • Betty Pu 
  • Brianna Mateo 
  • Camila Maldonado Franco 
  • Chi Qiu 
  • Fish Fischer 
  • Grace Rade
  • Gracia Bareti
  • Jack Brandano 
  • Jenny Ding 
  • Julia Ting 
  • Maeve Tyler Penny
  • Michel Ruiz-Fuentes 
  • Nina Hernandez 
  • Parunjodhi Munisamy 
  • Sophia Silovsky 
  • Tint Tha Ra Wun 
  • Zihe Tian 
  • Molly Zelloe 
  • My My Tran 
  • Quinn Chamblin 
  • Nicole Tresvalles 
  • Noa Gittelman-Egan 
  • Tulip Daaboul 
  • Charavee Basnet Chettri 
  • Anaan Choudhury 
  • Beata Knecht 
  • JingYan Xu 
  • Abigail Paharsingh 
  • Hanna Lee 
  • Jeniah Franco 
  • Su Than Thar Nyi 
  • Katy Cummings

Contact Department of Statistical & Data Sciences

Wright Hall 226
Smith College
Northampton, MA 01063

Phone: 413-585-3520 Email: kdunphy@hkange.net

Kelley Dunphy 
Administrative Coordinator

Randi L. Garcia 
Chair, Program in Statistical & Data Sciences