Comments

You must log in or register to comment.

MrBookman_LibraryCop OP t1_jbi6ajh wrote

I ran a linear regression model to estimate the % difference in income levels between men and women by occupation based on the variables listed in the next paragraph. The model also includes Unpaid domestic work, but it was insignificant. I then used the model to predict the difference in income levels if all independent variables were equal - this is the equivalised difference in the chart (in blue).

Data used for the graph and linear model are from the Australian 2021 Census of Population and Housing. They are Total personal income (weekly), Occupation at the 4-digit ANZSCO level, Unpaid child care, Level of highest educational attainment, Hours worked and Volunteer Status.

Note that the model has an intercept of -0.018 which is highly significant (p < 6.08e-07), suggesting that there remains a structural difference in mean income levels after accounting for the factors listed above; even if domestic duties, hours worked etc. were all the same, there would still be a difference in income levels due to other factors.

Plot generated in R using ggplot2

4

WaterAirFireEarth t1_jbi8fmd wrote

I mean the data seems meaningful, but honestly it is certainly not beautiful to me.

21

bellingman t1_jbi8hl1 wrote

Averages please? Ideally weighted by population size?

For each group, and overall? Thanks in advance.

2

bellingman t1_jbi8mwp wrote

Saying "there is a difference" without giving the amount is maddeningly incomplete.

You should also include possible confounding variables that were not controlled for.

9

ibeerianhamhock t1_jbichoh wrote

Sounds interesting, but it is painful to try to read and digest. I think even if you just break it up a bit, that would be a good start.

2

WaterstarRunner t1_jbidfq5 wrote

That's what I assumed, but was more curious about the correction methodology.

Given the presumably high inverse correlation between hours worked and domestic hours worked, I'd assume that there's a risk of double-correction.

However it'd be awesome to see whether the impact of domestic hours worked has a bigger impact in certain job types over others, based on "uncounted working hours" which typically accompany higher paid roles.

4

bellingman t1_jbifpnb wrote

The lines are a little misleading, because it looks like a "range". And the visual weight is hugely below zero, so it appears to show that the wage gap is hugely negative.

Separating the "equivalized" numbers into their own chart would make clear that they dance pretty evenly around both sides of zero.

(Exactly how close to zero, we are evidently supposed to guess, as you mysteriously did not include the overall average!?)

2

BiChiSquare t1_jbis80i wrote

Someone needs to forward this to J. Peterson.

−1

jtravers887 t1_jbj7gmf wrote

What's the data source?

Remember when the first comment below a viz was a link to the data source? The good ol' days...

2

[deleted] t1_jbm6jwd wrote

So red is the observed difference and blue the observed difference after it has been adjusted?

Is linear regression the right analysis?

How did you determine which adjustment factors to use and the weight to assign to them?

1