Stata

Book description

Stata is one of the most popular statistical software in the world and suited for all kinds of users, from absolute beginners to experienced veterans. This book offers a clear and concise introduction to the usage and the workflow of Stata. Included topics are importing and managing datasets, cleaning and preparing data, creating and manipulating variables, producing descriptive statistics and meaningful graphs as well as central quantitative methods, like linear (OLS) and binary logistic regressions and matching. Additional information about diagnostical tests ensures that these methods yield valid and correct results that live up to academic standards. Furthermore, users are instructed how to export results that can be directly used in popular software like Microsoft Word for seminar papers and publications. Lastly, the book offers a short yet focussed introduction to scientific writing, which should guide readers through the process of writing a first quantitative seminar paper or research report. The book underlines correct usage of the software and a productive workflow which also introduces aspects like replicability and general standards for academic writing. While absolute beginners will enjoy the easy to follow point-and-click interface, more experienced users will benefit from the information about do-files and syntax which makes Stata so popular. Lastly, a wide range of user-contributed software („Ados") is introduced which further improves the general workflow and guarantees the availability of state of the art statistical methods.

Table of contents

  1. Cover
  2. Title Page
  3. Copyright
  4. Contents
  5. List of Notes
  6. 1 Introduction
    1. 1.1 Formatting
    2. 1.2 Graphic style
    3. 1.3 Version info
    4. 1.4 Online resources
    5. 1.5 Cheat sheet
  7. 2 The first steps
    1. 2.1 The graphical user interface (GUI)
    2. 2.2 Opening stata files
    3. 2.3 Importing non-Stata file formats
    4. 2.4 Entering data manually
    5. 2.5 Using preinstalled data
    6. 2.6 Saving and exporting data
    7. 2.7 The basic workflow
    8. 2.8 Do-files
    9. 2.9 Delimit and line breaks*
  8. 3 Cleaning and preparing data
    1. 3.1 Getting to know your data
    2. 3.2 Variable names and labels
    3. 3.3 Labeling values
    4. 3.4 IDs and unique identifiers
    5. 3.5 Missing values
    6. 3.6 Creating new variables
      1. 3.6.1 Special functions
    7. 3.7 The if qualifier
    8. 3.8 Changing and replacing variables
    9. 3.9 Removing observations and variables
    10. 3.10 Cleaning data systematically
    11. 3.11 Combining datasets*
      1. 3.11.1 Appending datasets
      2. 3.11.2 One-to-One Merge
      3. 3.11.3 Many-to-One Merge
      4. 3.11.4 One-to-Many Merge
      5. 3.11.5 All pairwise combinations
    12. 3.12 Reshaping data*
  9. 4 Describing data
    1. 4.1 Summarizing information
    2. 4.2 Using stored results*
    3. 4.3 Histograms
    4. 4.4 Boxplots
    5. 4.5 Simple bar charts
    6. 4.6 Scatterplots
    7. 4.7 Frequency tables
    8. 4.8 Summarizing information by categories
    9. 4.9 Editing and exporting graphs
      1. 4.9.1 Combining graphs
    10. 4.10 Correlations
    11. 4.11 Testing for normality
    12. 4.12 t-test for groups*
    13. 4.13 Weighting*
  10. 5 Introduction to causal analysis
    1. 5.1 Correlation and causation
    2. 5.2 Causal graphs
    3. 5.3 Estimating causal effects
    4. 5.4 What does “controlling” actually mean?*
  11. 6 Regression analysis
    1. 6.1 Research question
    2. 6.2 What is a regression?
    3. 6.3 Binary independent variable
    4. 6.4 Ordinal independent variable
    5. 6.5 Metric independent variable
    6. 6.6 Interaction effects*
      1. 6.6.1 The classic way
    7. 6.6.2 Marginal effects
    8. 6.6.3 Predicted values
    9. 6.6.4 Separate analyses by subgroups
    10. 6.7 Standardized regression coefficients*
  12. 7 Regression diagnostics
    1. 7.1 Exogeneity
    2. 7.2 Random sampling
    3. 7.3 Linearity in parameters
      1. 7.3.1 Solutions
    4. 7.4 Multicollinearity
    5. 7.4.1 Solutions
    6. 7.5 Heteroscedasticity
      1. 7.5.1 Solutions
    7. 7.6 Influential observations
      1. 7.6.1 Dfbetas
      2. 7.6.2 Cook’s distance
    8. 7.7 Summary
  13. 8 Logistic regression*
    1. 8.1 Introduction
    2. 8.2 Control variables
    3. 8.3 Nested Models
    4. 8.4 Diagnostics
      1. 8.4.1 Model misspecification
      2. 8.4.2 Sample size and empty cells
      3. 8.4.3 Multicollinearity
      4. 8.4.4 Influential observations
  14. 9 Matching
    1. 9.1 Simulating an experiment
    2. 9.2 Propensity score matching
    3. 9.3 Matching diagnostics
      1. 9.3.1 Common support
      2. 9.3.2 Balancing of covariates
  15. 10 Reporting results
    1. 10.1 Tables
    2. 10.2 Graphs
  16. 11 Writing a seminar paper
    1. 11.1 The basic structure
    2. 11.2 Master do-files
  17. 12 The next steps
    1. 12.1 Online sources and manuals
    2. 12.2 Books
  18. References
  19. Copyright
  20. Index

Product information

  • Title: Stata
  • Author(s): Felix Bittmann
  • Release date: February 2019
  • Publisher(s): De Gruyter Oldenbourg
  • ISBN: 9783110617191