Finance and Economics Discussion Series: Accessible versions of figures for 2020-036

Should Children Do More Enrichment Activities? Leveraging Bunching to Correct for Endogeneity

Accessible version of figures


Figure 1: Daily Time Breakdown
Note: Panel plots the average division of enrichment time into different sub-categories over a typical week. The figure pools the 1997, 2002 and 2007 CDS waves.

Share of Week Enrichment Broad Enrichment Active Leisure Passive Leisure Duties/Chors Class Sleep Remainder
Share of Week 0.0311 0.0240 0.0732 0.1041 0.1469 0.1843 0.4000 0.0364

Return to text


Figure 2: Enrichment Time Breakdown
Note: Panel plots the average division of enrichment time into different sub-categories over a typical week. The figure pools the 1997, 2002 and 2007 CDS waves.

Share of Enrichment Read Book Other Reading Being Read to Homework Before/After School Other Education Other Classes Non-Academic Lessons
Share of Enrichment 0.1409 0.0428 0.0138 0.6577 0.0747 0.0189 0.0184 0.0328

Return to text


Figure 3: Evidence of Bunching at Zero Enrichment Time
Note: Figure plots the cumulative density function of time spent per week on enrichment activities (in hours) for our full sample.

Figure plots the cumulative density function (cdf) of time spent per week on enrichment activities (in hours) for our full sample. The cdf shows a mass of about 30 percent at 0, increaseses in a concave way, with the median at around 5 hours and the 80th percentile at 10 hours. The x-axis runs between 0 and 50 hours.

Return to text


Figure 4: Evidence that Bunching is Selective
Note: Each panel shows a plot of the local linear estimator of the expected value of a variable conditional on enrichment time, along with its 90% confidence interval. The expected value of the variable among the children who spent no time on enrichment is also shown, along with its 90% confidence interval. Finally, the $$p$$-value of a test for whether there is discontinuity at zero is shown in the header of each panel.

Each panel shows a plot of the local linear estimator of the expected value of a variable conditional on enrichment time, along with its 90 percent confidence interval. The expected value of the variable among the children who spent no time on enrichment is also shown, along with its 90 percent confidence interval. Finally, the p-value of a test for whether there is discontinuity at zero is shown in the header of each panel. The upper left panel (child is black) shows a downward, concave relationship on the positive side with the point at 0 starkly higher (p-value = 0.017). The upper right panel (child is male) shows a clear, negative relationship on the positive side with the point at 0 starkly higher (p-value = 0.003). The middle left panel (mother is employed full time) shows a roughly flat relationship on the positive side with the point at 0 starkly higher (p-value = 0.006). The middle right panel (mother was married at birth) shows a positive relationship on the positive side with the point at 0 starkly lower (p-value = 0.036). The bottom left panel (child attends private school) shows a clear positive relationship on the positive side with the point at 0 starkly lower (p-value = 0.022). The bottom right panel (passive leisure) shows a strong negative relationship on the positive side with the point at 0 starkly higher (p-value = 0.000).

Return to text


Figure 5: Relationship Between Child's Ability and Enrichment Time
Note: In the left panel, the solid line denotes actual (chosen) enrichment, which is equal to desired enrichment when desired enrichment is non-negative. For negative desired enrichment (dashed line), actual enrichment must be zero. The right panel inverts this relation, showing that this constraint will generate a discontinuity in the expected characteristics of children who do zero enrichment, since that group includes all the children for whom the constraint is binding. $$C$$ represents all other characteristics that determine enrichment (observed or unobserved).

In the left panel, the solid line denotes actual (chosen) enrichment, which is equal to desired enrichment when desired enrichment is non-negative. The relationship is positive for this portion of the figure. For negative desired enrichment (dashed line, continuing to show a positive relationship), actual enrichment must be zero. The right panel inverts this relation, showing that this constraint will generate a discontinuity in the expected characteristics of children who do zero enrichment, since that group includes all the children for whom the constraint is binding. In particular, on the positive size desired enrichment and ability are positively related, and the average ability at 0 enrichment is discontinuously lower. C represents all other characteristics that determine enrichment (observed or unobserved).

Return to text


Figure 6: Cognitive and Non-Cognitive Skills Production - High School
Note: This figure illustrates a potential explanation for our findings of zero net effects on cognitive skills and negative net effects on non-cognitive skills for youth in high school. The top curve shows how cognitive skills vary causally with enrichment. The lower curve shows the analogous relationship for non-cognitive skills. $$I_c$$ is the level of enrichment that maximizes cognitive skills, and $$I_{nc}$$ is the level of enrichment that maximizes non-cognitive skills. Around $$I_c$$, the net effect of enrichment on cognitive skills is close to zero and its corresponding effect on non-cognitive skills is negative.

This figure illustrates a potential explanation for our findings of zero net effects on cognitive skills and negative net effects on non-cognitive skills for youth in high school. The figure shows two concave curves where the x-axis in enrichment time and the y-axis is skill. Both curves start at 0, increase to a maximum, and then decline, with one curve sitting inside the other. The top curve shows how cognitive skills vary causally with enrichment. The lower curve shows the analogous relationship for non-cognitive skills. I_c is the level of enrichment that maximizes cognitive skills, corresponding the peak of the top curve, and I_{nc} is the level of enrichment that maximizes non-cognitive skills. Around I_c, the net effect of enrichment on cognitive skills is close to zero and its corresponding effect on non-cognitive skills is negative.

Return to text


Figure 7: Enrichment Time Breakdowns by Grade Level
Note: Panels plot the average division of time into different categories over a typical week for each grade level. The figure pools the 1997, 2002 and 2007 CDS waves.

Grade Level Read Book Other Reading Being Read to Homework Before/After School Other Education Other Classes Non-Academic Lessons
Grades PreK-5 0.1589 0.0497 0.0317 0.5155 0.1773 0.0084 0.0194 0.0391
Grades 6-8 0.1608 0.0478 0.0086 0.6619 0.0490 0.0165 0.0153 0.0401
Grades 9-12 0.1049 0.0316 0.0023 0.7852 0.0047 0.0310 0.0204 0.0199

Return to text


Figure 8: Enrichment Time by Income Tercile: Grades 9-12
Note: This Figure shows the empirical cumulative distribution functions of enrichment time for each household income tercile, among high school children.

This figure shows the empirical cumulative distribution functions of enrichment time for each household income tercile, among high school children. The medium- and low-income cdfs lie practically on top of each other, while the top-tercile cdf lies notably below both everywhere.

Return to text


Figure 9: Child Time Use by Enrichment Time: Low-Income Children in Grades 9-12
Note: Each panel shows a plot of the local linear estimator of the expected value of a variable for a given amount of time spent on enrichment, along with its 90% confidence interval. The expected value of the variable for the children who spent no time on enrichment is also shown, along with its 90% confidence interval. Finally, the $$p$$-value of a test for whether there is discontinuity at zero time on enrichment is also shown in the header of each panel.

Each panel shows a plot of the local linear estimator of the expected value of a variable for a given amount of time spent on enrichment, along with its 90 percent confidence interval. The expected value of the variable for the children who spent no time on enrichment is also shown, along with its 90 percent confidence interval. Finally, the p-value of a test for whether there is discontinuity at zero time on enrichment is also shown in the header of each panel. The upper left panel (play and social activities) shows a negative relationship on the positive side with the point at 0 starkly higher (p-value = 0.01). The upper right panel (passive leisure) shows a clear, negative relationship on the positive side with the point at 0 that is contiguous with the positive-side relationship (p-value = 0.399). The middle left panel (duties/chores) shows a modest negative relationship on the positive side with the point at 0 contiguous with the positive-side relationship (p-value = 0.358). The middle right panel (other enrichment activities) shows a roughly flat relationship on the positive side with the point at 0 starkly higher (p-value = 0.056). The bottom left panel (sleeping) shows a clear negative relationship on the positive side with the point at 0 that is contiguous with the postive side relationship (p-value = 0.447). The bottom right panel (class) shows an ambiguous relationship on the positive side with the point at 0 starkly lower (p-value = 0.000).

Return to text


Figure 10: Child Time Use by Enrichment Time: Middle-Income Children in Grades 9-12
Note: Each panel shows a plot of the local linear polynomial estimator of the expected value of a variable for a given amount of time spent on enrichment, along with its 90% confidence interval. The expected value of the variable for the children who spent no time on enrichment is also shown, along with its 90% confidence interval. Finally, the $$p$$-value of a test for whether there is discontinuity at zero time on enrichment is also shown in the header of each panel.

Each panel shows a plot of the local linear polynomial estimator of the expected value of a variable for a given amount of time spent on enrichment, along with its 90\% confidence interval. The expected value of the variable for the children who spent no time on enrichment is also shown, along with its 90% confidence interval. Finally, the p-value of a test for whether there is discontinuity at zero time on enrichment is also shown in the header of each panel. The upper left panel (play and social activities) shows a negative relationship on the positive side with the point at 0 contiguous with the postive side relationship (p-value = 0.38). The upper right panel (passive leisure) shows a clear, negative relationship on the positive side with the point at 0 that is starkly higher (p-value = 0.025). The middle left panel (duties/chores) shows a modest negative relationship on the positive side with the point at 0 contiguous with the positive-side relationship (p-value = 0.338). The middle right panel (other enrichment activities) shows a negative relationship on the positive side with the point at 0 starkly lower (p-value = 0.060). The bottom left panel (sleeping) shows a negative relationship on the positive side with the point at 0 that is starkly higher (p-value = 0.032). The bottom right panel (class) shows a flat relationship on the positive side with the point at 0 starkly lower (p-value = 0.000).

Return to text


Figure 11: Time Breakdowns - Other Time Aggregates

Other Enrichment Activities Computer (Educational) Volunteering Structured Sports Arts Art Excursions
Time Breakdown 0.0897 0.0138 0.6144 0.1690 0.1130

Return to text


Passive Leisure Television Other Media Other Leisure
Time Breakdown 0.6685 0.3039 0.0276

Return to text


Play and Social Activities Games Hobbies Sports Religious Groups Other Groups Conversation Socializing
Time Breakdown 0.4351 0.0643 0.0480 0.1192 0.0276 0.1727 0.1331

Return to text


Duties/Chores Care of Others Chores Paid Work Travelling Shopping Personal Care Meals
Time Breakdown 0.0113 0.1138 0.0553 0.2804 0.0639 0.2100 0.2654

Return to text


Figure 12: Enrichment Time Breakdowns - High School, By Income Tercile
Note: Panels plot the average division of time into different categories over a typical week for each income tercile among those in grades 9-12. The figure pools the 1997, 2002 and 2007 CDS waves.

Income Tercile Read Book Other Reading Being Read to Homework Before/After School Other Education Other Classes Non-Academic Lessons
Low-Income 0.1006 0.0269 0.0011 0.7699 0.0085 0.0545 0.0335 0.0051
Medium-Income 0.0909 0.0236 0.0017 0.8044 0.0055 0.0302 0.0177 0.0261
High-Income 0.1138 0.0385 0.0032 0.7816 0.0026 0.0205 0.0158 0.0240

Return to text


Figure 13: Child Activities by Enrichment Time: High-Income Children in Grades 9-12
Note: Each panel shows a plot of the local linear estimator of the expected value of a variable for a given amount of time spent on enrichment, along with its 90% confidence interval. The expected value of the variable for the children who spent no time on enrichment is also shown, along with its 90% confidence interval. Finally, the $$p$$-value of a test for whether there is discontinuity at zero time on enrichment is also shown in the header of each panel.

Each panel shows a plot of the local linear estimator of the expected value of a variable for a given amount of time spent on enrichment, along with its 90\% confidence interval. The expected value of the variable for the children who spent no time on enrichment is also shown, along with its 90\% confidence interval. Finally, the p-value of a test for whether there is discontinuity at zero time on enrichment is also shown in the header of each panel. The upper left panel (play and social activities) shows a negative relationship on the positive side with the point at 0 contiguous with the postive side relationship (p-value = 0.462). The upper right panel(passive leisure) shows a clear, negative relationship on the positive side with the point at 0 that is contiguous with the positive-side relationship (p-value = 0.195). The middle left panel (duties/chores) shows a negative relationship on the positive side with the point at 0 contiguous with the positive-side relationship (p-value = 0.246). The middle right panel (other enrichment activities) shows a roughly flat relationship on the positive side with the point at 0 contiguous with the positive-side relationship (p-value = 0.377). The bottom left panel (sleeping) shows a negative relationship on the positive side with the point at 0 that is starkly higher (p-value = 0.019). The bottom right panel (class) shows a flat relationship on the positive side with the point at 0 starkly lower (p-value = 0.000).

Return to text


Figure 14: Evidence that Standard Estimates May be Biased Upward
Note: Each panel shows a plot of the local linear polynomial estimator of the expected value of the residuals from equation (1), estimated on the positive enrichment subsample, conditional on enrichment time, along with its 90% confidence interval. The expected value of the residuals for the children who spent no time on enrichment is also shown, along with its 90% confidence interval. Finally, the $$p$$-value of a test for whether there is discontinuity at zero is also shown in the header of each panel.

Each panel shows a plot of the local linear polynomial estimator of the expected value of the residuals from equation \eqref{eq: naive}, estimated on the positive enrichment subsample, conditional on enrichment time, along with its 90\% confidence interval. The expected value of the residuals for the children who spent no time on enrichment is also shown, along with its 90\% confidence interval. Finally, the p-value of a test for whether there is discontinuity at zero is also shown in the header of each panel. For both the left panel (cognitive skills) and the right panel (non-cognitive skills), the relationship on the positive side is roughly flat, with the point at 0 starkly lower (p-value 0.003 and 0.06, respectively).

Return to text


Figure 15: Estimates for Different Sub-samples of the Data - Full Sample
Note: Each panel shows the estimate of $$\beta $$ for cognitive (left panel) or non-cognitive skills (right panel) restricting the sample to only children whose enrichment hours are lower than or equal to $$I_{\max}$$ for values of $$I_{\max}$$ ranging from 1 (only those who chose $$I=0$$ or $$I=1$$) to 50 (everyone). These plots suggest that our main findings are not an artifact of the linearity in $$\eta $$ assumption.

Each panel shows the estimate of beta for cognitive (left panel) or non-cognitive skills (right panel) restricting the sample to only children whose enrichment hours are lower than or equal to I_{max} for values of I_{max} ranging from 1 (only those who chose I=0 or I=1) to 50 (everyone). These plots suggest that our main findings are not an artifact of the linearity in eta assumption, as the estimates are similar to the "everyone" case for all but the most restricted samples.

Return to text


Figure 16: Estimates for Different Sub-samples of the Data - Grades 9-12
Note: Each panel shows the analogous estimates to Figure 15 but for the sub-sample of children in high school.

Each panel shows the analogous estimates to Figure 15 but for the sub-sample of children in high school. These panels are qualitatively the same as those in Figure 15.

Return to text


Figure 17: Evidence of heteroskedasticity on the distribution of $$I\vert X$$
Note: Each curve depicts the CDF of $$I$$ for white and Hispanic high school students. The curves show evidence of heteroskedasticity in the distribution of desired enrichment for different values of the controls.

Each curve depicts the CDF of I for white and Hispanic high school students. The curves show evidence of heteroskedasticity in the distribution of desired enrichment for different values of the controls.

Return to text


Figure 18: Homoskedastic Tobit Fit
Note: Each panel depicts the CDF of enrichment ($$I$$) for white (left panel) and Hispanic (right panel) high school students presented in Figure 17 (thick curve) along with the corresponding homoskedastic Tobit fit (thin curve). The plots show evidence that the homoskedastic normal fit for positive values of enrichment is not satisfactory, which may lead us to over-estimate the magnitude of $$\mathbb{E}[I^*\vert I^*\leq 0,X]$$ and underestimate the magnitude of $$\delta $$.

Each panel depicts the CDF of enrichment (I) for white (left panel) and Hispanic (right panel) high school students presented in Figure 17 (thick curve) along with the corresponding homoskedastic Tobit fit (thin curve). The plots show evidence that the homoskedastic normal fit for positive values of enrichment is not satisfactory, which may lead us to over-estimate the magnitude of the censored expectation and underestimate the magnitude of delta.

Return to text


Figure 19: Heteroskedastic Tobit Fit
Note: Each panel depicts the CDF of enrichment ($$I$$) for white (left panel) and Hispanic (right panel) high school students presented in Figure 17 (thick curve) along with the corresponding heteroskedastic Tobit fit (thin curve). The plots show evidence of a better fit than the homoskedastic case for positive values of enrichment. The tails seem to be fatter in the empirical distribution in comparison with the fit, which may lead us to somewhat under-estimate the magnitude of $$\mathbb{E}[I^*\vert I^*\leq 0,X]$$ and over-estimate the magnitude of $$\delta $$.

Each panel depicts the CDF of enrichment (I) for white (left panel) and Hispanic (right panel) high school students presented in Figure 17 (thick curve) along with the corresponding heteroskedastic Tobit fit (thin curve). The plots show evidence of a better fit than the homoskedastic case for positive values of enrichment. The tails seem to be fatter in the empirical distribution in comparison with the fit, which may lead us to somewhat under-estimate the magnitude of the censored expectation and over-estimate the magnitude of delta.

Return to text


Figure 20: Heteroskedastic Symmetric Fit
Note: Each panel depicts the CDF of $$I$$ for white (left panel) and Hispanic (right panel) high school students presented in Figure 17 (thick curve) along with the corresponding heteroskedastic Symmetric fit (thin curve).

Each panel depicts the CDF of I for white (left panel) and Hispanic (right panel) high school students presented in Figure 17 (thick curve) along with the corresponding heteroskedastic Symmetric fit (thin curve).

Return to text


Figure 21: Uncorrected and Corrected Cognitive and Non-Cognitive Estimates
Note: Left figure shows cognitive estimates, and right figure shows non-cognitive estimates. Shaded areas depict the 90% confidence intervals. All standard errors are bootstrapped using 500 iterations.

Left figure shows cognitive estimates, and right figure shows non-cognitive estimates. Shaded areas depict the 90% confidence intervals. All standard errors are bootstrapped using 500 iterations. Both panels show that the corrected and un-corrected estimates do not change very much as the number of clusters is increased.

Return to text