Dr. Codd Was Right: A Model Citizen

Kent State Leads by 2 - Let's Go MAGA!! Ya couda had #3 on 2/2

[T]hey've got to draw in their horns and stop their aggression, or we're going to bomb them back into the Stone Age. And we would shove them back into the Stone Age with Air power or Naval power -- not with ground forces.
-- Curtis LeMay/1965 [didn't work in Nam, won't work in Iran; these boots are made for walkin]

We'll smash down your doors, we don't bother to knock
We've done it before, so why all the shock?
We're the biggest and the toughest kids on the block
And we're the cops of the world, boys
We're the cops of the world
-- Phil Ochs/1966 [from back when Americans gave a shit]

There's not a single example of things working out for the appeaser.
-- Nicolle Wallace/2024 [like this? the next extortion is on the way]

Why should I go to that cemetery? It's filled with losers.
-- Batshit J. Moron/2018 [I guess Iran War Dead are losers, too?]

Effective with the 2026 mid-term elections, military proctors will be stationed at every shithole Blue city polling place, demanding to see a current, valid United States of Alabama passport. No passport, no vote. I am the dictator.
-- Donald J. Trump by Executive Order, this day 7 December 2025 [let's not wait that long and Chris Murphy and this and Steve]
[I]t's election subversion, not cancellation, that is the real authoritarian move. The goal is to keep elections going but without unseating those in power.
-- Sean Morales-Doyle/2026 [Ayatollah 49% Don^©: "I won with 98.6% of the vote!!]

I have had to explain and re-explain and re-explain and re-explain, you know, how relational databases work, what is an eigenvector, what is dimensionality reduction.

-- Christopher Wylie/2018

I believe quite strongly that, if you think about the issue at the appropriate level of abstraction, you're inexorably led to the position that databases must be relational.

-- Chris Date/2009

This Week's thought

This is malignant narcissism flavored with insane Nobel Peace Prize-related self-pity, the usual Trumpian unfitness magnified by the excitement of his Venezuelan intervention and the vicissitudes of old age, with the entire NATO alliance imperiled by the warmongering whims of its leading power's would-be Caesar.
-- Ross Douthat/2026 [well... Caligua. is MAGA going, going, gone?]

See you next week in a brand new show^{©Heckle and Jeckle}

Therefore:

In a time of SSD, multi-core/processor, two terabyte memory and Optane App Direct Mode (RIP) machines, there is no reason not to build from BCNF data. Time to do what Dr. Codd demonstrated. Technology has finally caught up with the maths.

11 October 2011

A Model Citizen

While it is gratifying to be published by Simple Talk, so many more eyes that way, it isn't a platform where I can continue to prattle on at will. Each piece they publish, most of the time, is a stand alone effort. Since the piece was already rather long, there was one tangent I elected not to include, since it is a separate issue from the task being discussed.

"That subject: cleavages." Well, I only wish (and if you know from whence that quote came, bravo). No, alas, the topic is what to do with regard to fully understanding "bang for the buck". I elided that in the piece, since the point was to show that a useful stat graphic could be generated from the database. But how to discover the "true" independent variables of electoral primacy, and their magnitude? Could it be that with all the data we might have, both for free on the intertubes and costly which we generate, our best model is only 30% predictive? To reiterate, the exercise isn't to predict who'll win (FiveThirtyEight has been spectacular), but rather which knobs and switches a given organizations can manipulate to *change* a losing situation.

If you'll recall, most of the explanatory variables weren't of a continuous nature, that is, real numbers. The fitted lines in the scatterplots used a variation on simple linear regression to fit. The variation dealt with the differing best slopes over ranges. The technique doesn't account for the fact that most of the explanatory variables are either categorical (yes/no) or discrete (strongly disagree to strongly agree).

For this kind of mixed data regression, one typically uses analysis of covariance (aka, ancova). R, as one would expect, provides this. The Crawley book devotes a full chapter to ancova. I'll direct you there. Some say that discrete independent variables can be used directly in simple linear regression. Others would run to ANOVA immediately. Some distinguish categorical variables (gender) from discrete scaled variables (the 5 point agree scale on gun control). It is, suffice to say, not a slam dunk any way you go.

Exploratory data analysis, what R is particularly good at, is where the apparatchiks should be spending much of their effort (not worrying about the entrails of Rails!). Assuming that money is the driver of winning is an assumption, frequently wrong in the real world. Since their organization is large, national in scope, and full of dollars to spend; spelunking through all available data is the directive. That assumes, of course, that winning elections, without regard to policy positions, is the goal. Think of selling nappies.

While the goal of the piece was to display something simple to the Suits, determining a more accurate predictive model, which will be implemented with traditional text output, is the real goal. Same is true of selling nappies. The analogy is not so far fetched, as this book demonstrates; there have been similar treatises in the years since.

Dr. Codd Was Right

Kent State Leads by 2 - Let's Go MAGA!! Ya couda had #3 on 2/2

About

Shameless Plug

Extended Pieces

Good Stuff

Followers

Blog Archive

11 October 2011

A Model Citizen

No comments: