Academia.edu no longer supports Internet Explorer.

To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to  upgrade your browser .

Enter the email address you signed up with and we'll email you a reset link.

  • We're Hiring!
  • Help Center

paper cover thumbnail

Simple Linear Regression

Profile image of linda herkenhoff

Coefficient of determination, Method of least squares, Market model, Regression, Residual, Residual plots, Slope, Standardized residuals, y- intercept, \( \bar{y} \), y i , and \( \hat{y} \).

Related Papers

shashank srivastava

simple linear regression research paper pdf

MUSTAPHA NASIR USMAN

Regression models form the core of the discipline of econometrics. Although econometricians routinely estimate a wide variety of statistical models, using many different types of data, the vast majority of these are either regression models or close relatives of them. In this chapter, we introduce the concept of a regression model, discuss several varieties of them, and introduce the estimation method that is most commonly used with regression models, namely, least squares. This estimation method is derived by using the method of moments, which is a very general principle of estimation that has many applications in econometrics. The most elementary type of regression model is the simple linear regression model, which can be expressed by the following equation: y t = β 1 + β 2 X t + u t. (1.01) The subscript t is used to index the observations of a sample. The total number of observations, also called the sample size, will be denoted by n. Thus, for a sample of size n, the subscript t runs from 1 to n. Each observation comprises an observation on a dependent variable, written as y t for observation t, and an observation on a single explanatory variable, or independent variable, written as X t. The relation (1.01) links the observations on the dependent and the explanatory variables for each observation in terms of two unknown parameters, β 1 and β 2 , and an unobserved error term, u t. Thus, of the five quantities that appear in (1.01), two, y t and X t , are observed, and three, β 1 , β 2 , and u t , are not. Three of them, y t , X t , and u t , are specific to observation t, while the other two, the parameters, are common to all n observations. Here is a simple example of how a regression model like (1.01) could arise in economics. Suppose that the index t is a time index, as the notation suggests. Each value of t could represent a year, for instance. Then y t could be household consumption as measured in year t, and X t could be measured disposable income of households in the same year. In that case, (1.01) would represent what in elementary macroeconomics is called a consumption function.

Fabio Vanden Broeck

Devesh Deochake , Shewale Tanmay

The following technical paper presents two case studies pertaining to Linear Regression analysis. Case study 1 presents the use regression analysis in the form of simple regression and multiple regression and elaborates the practical use of regression analysis in the decision making process of which predictor variables should be used in the analysis. Case study 2 presents the use of linear regression techniques in studying the September Sea Ice extent in the Arctic Ocean from year 1979 – 2012. This case study also shows the use of quadratic regression to represent the data with a continuously variable slopes in the regression equation.

Dr. Eng. Fenwicks S Musonye

David Adeabah

sumit verma

Linear Regression Analysis, 2nd edition (Wiley Series in Probability and Statistics) George A. F. Seber, Alan J. Lee Year: 2003 Edition: 2 Language: en Pages: 582

Jamie DeCoster

RELATED PAPERS

Robbi Kurniawan

JOP : Journal of the pancreas

Stefano Serra

Primeiro Congresso da REBRATS

Juliana Alvares Teodoro

turkish journal of sport and exercise

haci murat şahin

Elsa Ramalhosa

Notre Dame Journal of Formal Logic

David Asperó

Revista Ibero-Americana de Humanidades, Ciências e Educação

Pamela Cristina dos Santos

Suzana Nunes Caldeira

Journal of Teaching and Learning Physics

chaerul rochman

Journal of Trauma-injury Infection and Critical Care

Nabil Ebraheim

Renewable Energy

Sucharita Pal

Pedro Miguel Benitez Jimenez

Jerry Flores

Pain Management Nursing

SEViL OLGUN

Christel Müller-Goymann

Physics Letters B

BJOG: An International Journal of Obstetrics & Gynaecology

Jaime Cebreros Lozano

Hans-Georg Petersen

Esther Hernandez

RSC Advances

Mrinalini Puranik

Brian D. Warner

RELATED TOPICS

  •   We're Hiring!
  •   Help Center
  • Find new research papers in:
  • Health Sciences
  • Earth Sciences
  • Cognitive Science
  • Mathematics
  • Computer Science
  • Academia ©2024

Simple Linear Regression

  • First Online: 20 January 2023

Cite this chapter

simple linear regression research paper pdf

  • Daniel P. McGibney 10  

Part of the book series: International Series in Operations Research & Management Science ((ISOR,volume 337))

546 Accesses

In Albert Einstein’s quote above, he stresses the paramount importance of simplicity. In regression analysis, focusing on only two variables demonstrates the concepts simply. Thus, in Chap. 2 , we calculated the least squares line by using two variables. We also plotted scatterplots and calculated correlation coefficients to further assess the linear relationship. From this analysis, we obtained a detailed understanding of the relationship between two variables. Upon understanding a linear relationship, other more complicated processes become easier to grasp.

If you can’t explain it simply, you don’t understand it well enough. —Albert Einstein

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
  • Available as EPUB and PDF
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
  • Durable hardcover edition

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Author information

Authors and affiliations.

Department of Management Science, University of Miami, Coral Gables, FL, USA

Daniel P. McGibney

You can also search for this author in PubMed   Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

McGibney, D.P. (2023). Simple Linear Regression. In: Applied Linear Regression for Business Analytics with R. International Series in Operations Research & Management Science, vol 337. Springer, Cham. https://doi.org/10.1007/978-3-031-21480-6_4

Download citation

DOI : https://doi.org/10.1007/978-3-031-21480-6_4

Published : 20 January 2023

Publisher Name : Springer, Cham

Print ISBN : 978-3-031-21479-0

Online ISBN : 978-3-031-21480-6

eBook Packages : Business and Management Business and Management (R0)

Share this chapter

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

  • Publish with us

Policies and ethics

  • Find a journal
  • Track your research

U.S. flag

An official website of the United States government

The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

  • Publications
  • Account settings

Preview improvements coming to the PMC website in October 2024. Learn More or Try it out now .

  • Advanced Search
  • Journal List
  • Lippincott Open Access

Logo of lwwopen

Linear Regression in Medical Research

Patrick schober.

From the * Department of Anesthesiology, Amsterdam UMC, Vrije Universiteit Amsterdam, Amsterdam, the Netherlands

Thomas R. Vetter

† Department of Surgery and Perioperative Care, Dell Medical School at the University of Texas at Austin, Austin, Texas.

Related Article, see p 110

Linear regression is used to quantify the relationship between ≥1 independent (predictor) variables and a continuous dependent (outcome) variable.

In this issue of Anesthesia & Analgesia , Müller-Wirtz et al 1 report results of a study in which they used linear regression to assess the relationship in a rat model between tissue propofol concentrations and exhaled propofol concentrations (Figure.

An external file that holds a picture, illustration, etc.
Object name is ane-132-108-g001.jpg

Table 2 given in Müller-Wirtz et al, 1 showing the estimated relationships between tissue (or plasma) propofol concentrations and exhaled propofol concentrations. The authors appropriately report the 95% confidence intervals as a measure of the precision of their estimates, as well as the coefficient of determination ( R 2 ). The presented values indicate, for example, that (1) the exhaled propofol concentrations are estimated to increase on average by 4.6 units, equal to the slope (regression) coefficient, for each 1-unit increase of plasma propofol concentration; (2) the “true” mean increase could plausibly be expected to lie anywhere between 3.6 and 5.7 units as indicated by the slope coefficient’s confidence interval; and (3) the R 2 suggests that about 71% of the variability in the exhaled concentration can be explained by its relationship with plasma propofol concentrations.

Linear regression is used to estimate the association of ≥1 independent (predictor) variables with a continuous dependent (outcome) variable. 2 In the most simple case, thus referred to as “simple linear regression,” there is only one independent variable. Simple linear regression fits a straight line to the data points that best characterizes the relationship

between the dependent ( Y ) variable and the independent ( X ) variable, with the y -axis intercept ( b 0 ), and the regression coefficient being the slope ( b 1 ) of this line:

equation image

A model that includes several independent variables is referred to as “multiple linear regression” or “multivariable linear regression.” Even though the term linear regression suggests otherwise, it can also be used to model curved relationships.

Linear regression is an extremely versatile technique that can be used to address a variety of research questions and study aims. Researchers may want to test whether there is evidence for a relationship between a categorical (grouping) variable (eg, treatment group or patient sex) and a quantitative outcome (eg, blood pressure). The 2-sample t test and analysis of variance, 3 which are commonly used for this purpose, are essentially special cases of linear regression. However, linear regression is more flexible, allowing for >1 independent variable and allowing for continuous independent variables. Moreover, when there is >1 independent variable, researchers can also test for the interaction of variables—in other words, whether the effect of 1 independent variable depends on the value or level of another independent variable.

Linear regression not only tests for relationships but also quantifies their direction and strength. The regression coefficient describes the average (expected) change in the dependent variable for each 1-unit change in the independent variable for continuous independent variables or the expected difference versus a reference category for categorical independent variables. The coefficient of determination, commonly referred to as R 2 , describes the proportion of the variability in the outcome variable that can be explained by the independent variables. With simple linear regression, the coefficient of determination is also equal to the square of the Pearson correlation between the x and y values.

When including several independent variables, the regression model estimates the effect of each independent variable while holding the values of all other independent variables constant. 4 Thus, linear regression is useful (1) to distinguish the effects of different variables on the outcome and (2) to control for other variables—like systematic confounding in observational studies or baseline imbalances due to chance in a randomized controlled trial. Ultimately, linear regression can be used to predict the value of the dependent outcome variable based on the value(s) of the independent predictor variable(s).

Valid inferences from linear regression rely on its assumptions being met, including

  • the residuals are the differences between the observed values and the values predicted by the regression model, and the residuals must be approximately normally distributed and have approximately the same variance over the range of predicted values;
  • the residuals are also assumed to be uncorrelated. In simple language, the observations must be independent of each other; for example, there must not be repeated measurements within the same subjects. Other techniques like linear mixed-effects models are required for correlated data 5 ; and
  • the model must be correctly specified, as explained in more detail in the next paragraph.

Whereas Müller-Wirtz et al 1 used simple linear regression to address their research question, researchers often need to specify a multivariable model and make choices on which independent variables to include and on how to model the functional relationship between variables (eg, straight line versus curve; inclusion of interaction terms).

Variable selection is a much-debated topic, and the details are beyond the scope of this Statistical Minute. Basically, variable selection depends on whether the purpose of the model is to understand the relationship between variables or to make predictions. This is also predicated on whether there is informed a priori theory to guide variable selection and on whether the model needs to control for variables that are not of primary interest but are confounders that could distort the relationship between other variables.

Omitting important variables or interactions can lead to biased estimates and a model that poorly describes the true underlying relationships, whereas including too many variables leads to modeling the noise (sampling error) in the data and reduces the precision of the estimates. Various statistics and plots, including adjusted R 2 , Mallows C p , and residual plots are available to assess the goodness of fit of the chosen linear regression model.

IMAGES

  1. (PDF) Linear Regression Analysis Using R for Research and Development

    simple linear regression research paper pdf

  2. Estimated simple linear regression equation

    simple linear regression research paper pdf

  3. Intro to linear regression analysis

    simple linear regression research paper pdf

  4. Summary Table of the Result of Simple Linear Regression Analysis

    simple linear regression research paper pdf

  5. Simple Linear Regression

    simple linear regression research paper pdf

  6. SIMPLE LINEAR REGRESSION

    simple linear regression research paper pdf

VIDEO

  1. 07. Lecture 2.3 simple linear regression practical example on SPSS1 1

  2. Regression Analysis, Simple Regression (Intro) -Chapter 5

  3. Introducing Simple Linear Regression

  4. Simple Linear Regression explained with question paper

  5. Introduction to Linear Models

  6. 4.1.2.4 Lab

COMMENTS

  1. (PDF) Simple Linear Regression

    Simple linear regression estimates the coe fficients b and b of a linear model which predicts the. value of a single dependent variable ( y) against a single independent variable ( x) in the form ...

  2. PDF Chapter 9 Simple Linear Regression

    9.1. THE MODEL BEHIND LINEAR REGRESSION 217 0 2 4 6 8 10 0 5 10 15 x Y Figure 9.1: Mnemonic for the simple regression model. than ANOVA. If the truth is non-linearity, regression will make inappropriate predictions, but at least regression will have a chance to detect the non-linearity.

  3. PDF An Introduction to Simple Linear Regression

    In contrast, if a regressor violates the underlying regression assumption, then it becomes difficult to generalize a model. Table 1-1. Linear Regression Assumptions Assumption Description Linearity there must be a linear relationship between an independent variable and a dependent variable. We verify this assumption using a pairwise scatter plot.

  4. Simple linear regression

    The most basic regression relationship is a simple linear regression. In this case, E ( Y | X) = μ ( X) = β0 + β1X, a line with intercept β0 and slope β1. We can interpret this as Y having a ...

  5. PDF Simple Linear Regression

    Simple linear regression refers to linear regression with one predictor variable and one response variable. In the previous chapter, our primary focus entailed the calculation and fundamental programming concepts of linear regression. The goal of this chapter is to guide the reader through some of the understanding and analysis details of ...

  6. PDF Simple linear regression

    dict the values of one variable using the values of the other. One of. the simplest prediction methods is linear regression, in which we. attempt to find a 'best line' through the data points ...

  7. PDF Simple Linear Regression

    Linear regression is commonly used in applied research. We will explore how to use linear regression for causal effect estimation. To build intuition, we focus on the application of simple linear regression. Linear Regression and Causality. Regression conditional expectation function of Y given X. E(Y j X) = f (X) = >X.

  8. Simple Linear Regression

    Regression allows you to estimate how a dependent variable changes as the independent variable (s) change. Simple linear regression example. You are a social researcher interested in the relationship between income and happiness. You survey 500 people whose incomes range from 15k to 75k and ask them to rank their happiness on a scale from 1 to ...

  9. (PDF) Simple Linear Regression

    The most elementary type of regression model is the simple linear regression model, which can be expressed by the following equation: y t = β 1 + β 2 X t + u t. (1.01) The subscript t is used to index the observations of a sample. The total number of observations, also called the sample size, will be denoted by n.

  10. PDF Statistical Analysis 6: Simple Linear Regression

    1 Statistical Analysis 6: Simple Linear Regression Research question type: When wanting to predict or explain one variable in terms of another What kind of variables? Continuous (scale/interval/ratio) Common Applications: Numerous applications in finance, biology, epidemiology, medicine etc. Example 1: A dietetics student wants to look at the relationship between calcium intake and knowledge about

  11. Simple Linear Regression

    The simple linear regression model has 2 coefficients ( β0 and β1) as shown in Eq. (4.1). In simple linear regression, it is always the case that the degrees of freedom for the SSE are n − 2. From the approximation of the variance, an approximation of the standard deviation can be found by taking the square root:

  12. PDF Using regression analysis to establish the relationship between home

    Home environment and reading achievement research has been largely dominated by a focus on early reading acquisition, while research on the relationship between home environments and reading success with preadolescents (Grades 4-6) has been largely overlooked. There are other limitations as well. Clarke and Kurtz-Costes (1997) argued that prior ...

  13. PDF Machine Learning 1: Linear Regression

    A simple model A linear model that predicts demand: predicted peak demand = 1 (high temperature) + 2 60 65 70 75 80 85 90 95 1.5 2 2.5 3 High Temperature (F) Peak Hourly Demand (GW) Observed data Linear regression prediction Parameters of model: 1; 2 2R ( 1 = 0:046, 2 = 1:46) Stefano Ermon Machine Learning 1: Linear Regression March 31, 2016 8 / 25

  14. Linear Regression Analysis on Net Income of an Agrochemical Company in

    Simple linear regression: Simple linear regression is a model with a single regressor x that has a. relationship with a response y that is a straight line. This simple linear regression. model can be expressed as. y = β0 + β1x + ε. where the intercept β0 and the slope β1 are unknown constants and ε is a random.

  15. PDF SIMPLE LINEAR REGRESSION

    SIMPLE LINEAR REGRESSION Documents prepared for use in course B01.1305, New York University, Stern School of Business Fictitious example, n = 10. Page 3 This shows the arithmetic for fitting a simple linear regression. Summary of simple regression arithmetic page 4 This document shows the formulas for simple linear regression, including

  16. Linear Regression in Medical Research

    Linear regression is used to estimate the association of ≥1 independent (predictor) variables with a continuous dependent (outcome) variable. 2 In the most simple case, thus referred to as "simple linear regression," there is only one independent variable. Simple linear regression fits a straight line to the data points that best characterizes the relationship

  17. Regression: simple linear

    Regression is a statistical term used for describing models that estimate the relationships among variables. Linear regression models study the relationship between a single dependent variable Y an...

  18. PDF Multiple Linear Regression (2nd Edition) Mark Tranmer Jen Murphy Mark

    In both cases, we still use the term 'linear' because we assume that the response variable is directly related to a linear combination of the explanatory variables. The equation for multiple linear regression has the same form as that for simple linear regression but has more terms: = 0 +. 1 +. 2 + ⋯ +.

  19. Using Simple Linear Regression to Assess the Success of the Montreal

    This paper is meant to provide an illustration of this amended recommendation. The data and accompanying analysis presented in this paper provide both a meaningful example of data analysis using simple linear regression and a story of remarkable success of international cooperation addressing a global environmental problem.

  20. Estimation Approach for a Linear Quantile-Regression Model with Long

    The aim of this paper is to assess the significant impact of using quantile analysis in multiple fields of scientific research . Here, we focus on estimating conditional quantile functions when the errors follow a GARMA (Generalized Auto-Regressive Moving Average) model. Our key theoretical contribution involves identifying the Quantile-Regression (QR) coefficients within the context of GARMA ...

  21. Predictive modeling of PV solar power plant efficiency considering

    The multiple linear regression formula as follows: (2) Y i = b 0 + b 1 X 1 + ⋯ + b i X i + ⋯ + b k X k + ɛ, i = 1, 2, …, k In Eq. (2), Y i is the dependent variable observed in multiple linear regression, X i 's are the independent variables, b's are the regression coefficients, ɛ is a fixed number. Using the least squares method ...

  22. An improved regression‐based and observation global maximum power point

    This paper proposes a regression-based P&O MPPT method to find GMPP speedily, avoiding being stuck in local MPP. Generated PVMs on a flexible Python environment have been used in linear and non-linear regression analyses. Regression analyses narrowed search areas to improve the speed of finding MPPT for preventing falling into local maxima.