Understanding Ku's Articles

kucoach7 · Post by **kucoach7** » Thu Aug 04, 2016 8:15 am

This is a supplemental reference to aid readers in understanding Kucoach's sevenfortyseven.com articles written by his alter ego, Nathan Sliver. In case you're unaware, these articles are a parody of Nate Silver's fivethirtyeight.com, which means ku employs data-based techniques to try to explain our league.

Multivariable Regression
Ku often employs regression as a method of explaining relationships. A simple regression between two variables, say a player's FT rating and a players FT percentage, draws a line between the observations such that the sum of the squared difference between each point and the line is minimized. In other words, it draws a line between a bunch of points to try to explain the relationship between them. Like so.

: Simple Reg.PNG (30.96 KiB) Viewed 1059 times

When the model includes more than one explanatory variable (e.g. a player's inside, jump shooting, free throw, and 3 point rating explaining offensive efficiency), our simple human minds can’t really visualize it that well because it requires more than the 2 or 3 dimensions our brains can handle so we resort to numbers.

Summary of Fit Table
Really the only number I'll ever reference in this table is the Adjusted R squared. This number explains how much of the variation in the dependent variable (offensive efficiency) is explained by the independent variables in the model. In the above example, FT rating explains 70% of the variance observed in FT percentage. If the model is good then the remainder of the variance is explained by randomness, however it is often the case that the remaining variation is explained by omitted variables (in this case, maybe strength, stamina, and whether the FT was shot at home). The adjusted R squared is preferable to the R squared because it penalizes the model for including irrelevant variables, whereas the R squared will always increase when you add a new variable, whether relevant or not.

Analysis of Variance Table
The only number you need to worry about here is the F stat. If this number is very small, less than 0.05, than it indicates that the difference explanatory variables included are explaining different things, i.e. the variables aren’t really all just difference manifestations of the same property.

Residuals by Predicted Plot
On occasion I will include a residuals by predicted plot. With this plot we want the pattern of points to look random. If there is any sort of pattern or trend to the points, it suggests skewness of the data or a poorly fit model.

: Residual by predicted.PNG (10.11 KiB) Viewed 1059 times

Variable/Indicator Parameterization Function Table
This is the most important part of the results. This table describes the relationships between the explanatory variables and the dependent variable. This is best explained with an example.

: Variables.PNG (8.42 KiB) Viewed 1059 times

Let’s look at the estimate for cohesion. You can essentially think of this as a coefficient for a linear equation or the marginal effect of each variable. In this case, the estimate is 0.033. This means that if cohesion increases by one level then we would expect a 0.033 increase in winning percentage on average and holding all else equal i.e. if two teams were exactly the same but one team had good cohesion and the other had average, we would expect the team with good cohesion to have a winning percentage 0.033 higher. This is just an estimate however. If we look over at the lower and upper 95% columns. If we were to run this model 100 times the estimate for cohesion would lie between 0.014 and 0.052 95 times. If zero is in that range, then our estimate is probably not good enough to talk about because we’re not sure that the variable has any effect at all. If zero is in the 95% confidence interval then the p-value (prob>|t|) will be greater than 0.05, meaning that we are really only interested in estimates if the p-value is lower than 0.05.

Well, we'll start with that for now.

Darth Vegito · Post by **Darth Vegito** » Thu Aug 04, 2016 11:14 am

LOVE THESE ARTICLES KU! Thanks for your time you put into these but I have one request.

Could you possibly write a "An Interpretation of "Understanding Ku's Articles" For Dummies" ???
That'd be great!

Sent from my SM-G930V using Tapatalk

LoCo89 · Post by **LoCo89** » Thu Aug 04, 2016 11:38 am

DarthVegito wrote:LOVE THESE ARTICLES KU! Thanks for your time you put into these but I have one request.

Could you possibly write a "An Interpretation of "Understanding Ku's Articles" For Dummies" ???
That'd be great!

Sent from my SM-G930V using Tapatalk

Glol we need KU remedial courses

kucoach7 · Post by **kucoach7** » Thu Aug 04, 2016 11:49 am

DarthVegito wrote:LOVE THESE ARTICLES KU! Thanks for your time you put into these but I have one request.

Could you possibly write a "An Interpretation of "Understanding Ku's Articles" For Dummies" ???
That'd be great!

Sent from my SM-G930V using Tapatalk

glol

NOLa. · Post by **NOLa.** » Thu Aug 04, 2016 12:22 pm

Give this man another 5 points!

NickMalone77 · Post by **NickMalone77** » Thu Aug 04, 2016 4:23 pm

You are awarded 5 points for your media contribution. Thank you!

ballsohard · Post by **ballsohard** » Fri Aug 05, 2016 9:36 am

Understanding KU's Graphs:

Multivarible Regression: an excuse to put a dick graph up incognito.

Residuals by Predicted Plot: The attempts and success rate of Doug's trolls in skype

Variable/Indicator Parameterization Functional Table: Tani's formula that 78 used.

Sent from my iPad using Tapatalk

#SLOE: People's Basketball Sim League

Understanding Ku's Articles

Understanding Ku's Articles

Re: Understanding Ku's Articles

Re: Understanding Ku's Articles

Re: Understanding Ku's Articles

Re: Understanding Ku's Articles

Re: Understanding Ku's Articles

Re: Understanding Ku's Articles