upvote
How hard have you thought about this?

The biggest challenge with running a census is getting people to trust you enough to answer your questions.

A lot of census questions are sensitive. The ACS covers topics like citizenship status, disabilities, income, SNAP assistance, languages spoken at home.

If you want accurate information about the people who live in your country you need the census process to feel as safe for people to respond to as possible.

Are you saying the census shouldn't collect any data that people wouldn't be comfortable publishing? Because that's a recipe for a census that is far less useful for helping the country make useful decisions.

reply
> Are you saying the census shouldn't collect any data that people wouldn't be comfortable publishing? Because that's a recipe for a census that is far less useful for helping the country make useful decisions.

I'll say that. The state representatives should provide congress and the president any data needed to inform policy decisions about the people they represent. And as others have pointed out, other departments and agencies (such as the IRS) have most of the rest of the data required to make policy decisions.

Except for gerrymandering purposes, I fail to see why income, party affiliations, etc., is useful for the purpose the census was created for.

reply
The census doesn't collect party affiliations.

https://www.census.gov/topics/public-sector/voting/about/faq...

> the CPS Voting and Registration Supplement does not ask any questions of a partisan nature.

reply
>And as others have pointed out, other departments and agencies (such as the IRS) have most of the rest of the data required to make policy decisions.

There are laws in place forbidding government agencies from merging together datasets.

The last thing people should support is creating of profiles of individuals by combining data from different government agencies. This is why the census is so important as a data collection mechanism.

reply
> There are laws in place forbidding government agencies from merging together datasets.

This is an excellent point. In my opinion, such laws are a good idea. Most of the time, policy decisions should not require IRS data. (Or other personal data.)

But to get around such laws, the government asks citizens to provide that data a second time (in the census). And sometimes it's asked yet again on other forms. This seems to defeat the purpose of those laws.

I can see that federal disaster aid might need to know if some area needs more or less aid, depending on the wealth of the area receiving aid. If aid is given to individuals, the have a need to know the individuals' income.

When there is a reasonable need to know, I would prefer the government use the much more accurate IRS data, rather than ask for people's income multiple times. The laws preventing merging federal datasets could be rethought, given what is now known about preserving privacy mathematically. I would like to see specific exemptions made, with the provided data properly anonymized to preserve privacy while serving the legitimate purpose for which the data was requested. The use of such data should require a request to congress for it.

reply
This seems’s like an issue created by congress. the constitution only requires a headcount by state. Maybe they should use another mechanism to collect demographic data. Since the concern is not about representation, but allocation, tax returns seem like an obvious alternative and they are already private and collected at a much more granular level.
reply
I don't think the question "Has this person given birth to any children in the past 12 months?" would look good on a tax return.
reply
Have you filled out a federal income tax return in the US?

It absolutely asks for the names (and SSN) of any dependents. It's trivial to infer whether one of the adult(s) filing the tax return gave birth in the last 12 months based on the last 2 years of tax returns for those adult(s).

reply
My home country pays a baby bonus to people and it's administered via the tax system, so I think we ask something very similar actually.
reply
The census isn't for helping the country make any decisions other than determining the number of representatives and apportionment of taxes. It should not be collecting any data that isn't necessary for that.
reply
https://constitution.congress.gov/browse/article-1/section-2...

> The actual Enumeration shall be made within three Years after the first Meeting of the Congress of the United States, and within every subsequent Term of ten Years, in such Manner as they shall by Law direct.

The key thing you're missing is "in such Manner as they shall by Law direct".

Congress has passed a whole bunch of laws that attach additional responsibilities to the census for the purpose of supporting government decisions.

The Permanent Census Office Act of 1902 for example, which established the census office and tacked on "an annual survey of cotton production, and other economic censuses" https://www.census.gov/about/history/historical-censuses-and...

reply
That's not true, they also wanted to get an understanding of who they were governing.
reply
I'd like to know when they stopped publishing census data. I have used it for genealogical purposes to track ancestors: you can see exactly who was living in which house, how they are related, and what their ages are (I found that women in my family often reported, both on the census and marriage documents, being younger than they actually were). I don't think I've seen data from after 1950, though.

I don't understand why the census would include SNAP data or income: surely the government already has that information. I have never doubted that the IRS knows my income better than I do. Maybe better use of existing datasets could restrict the census to less invasive questions.

reply
They haven't stopped but they don't happen immediately.

Detailed census records are published 72 years after they were collected; the last release (of 1950 census data) came out in 2022; the next one should be published in 2032.

See: https://www.archives.gov/research/census

reply
They didn't stop publishing census data. Its publication is delayed for approximately one human lifetime, to avoid affecting the living:

https://prologue.blogs.archives.gov/2022/01/20/census-record...

reply
The Census Bureau is a lot more than the 10-year Census, and it already makes very extensive use of IRS data and other administrative sources. Virtually everything that is published using these sources uses either differential privacy or other privacy protection methods that are prohibited by the order. I'm guessing that a lot of those pieces of data are just going to be put on hold until the order is reversed or weakened. A number of things might have to go away permanently, as there's almost certainly no way to protect privacy in them without some kind of noise infusion.

TBH I don't think the people who wrote this knew how much collateral impact it would have.

reply
Thank you for writing a much more thoughtful reply to this comment than I was drafting
reply
Replying to the ACS with accurate information is required by law, so they don't actually need to rely on people feeling safe to get answers.

I don't trust the Census Bureau with my data, so if this is as "dangerous" as the author and some people here seem to think, they shouldn't be collecting it in the first place.

reply
> Replying to the ACS with accurate information is required by law, so they don't actually need to rely on people feeling safe to get answers.

This works by the same principle as how nobody ever drives faster than the speed limit.

reply
I don't understand your point here. Are you saying compliance isn't enforced?

As someone who got an ACS survey not long ago and had no interest in completing it, it certainly appears to be.

reply
There's not many cases of enforcement. Non-response is taken about as seriously as the Robinson–Patman act. I think the Census Bureau is very reliant on people thinking there will be enforcement, however, which is why the materials they send all have a threatening aura. I don't know about the ACS, but for the decennial census I often felt like my job as an enumerator was just to bother people until they'd answer. The case would keep being recycled until we got at least (IIRC) a head count.
reply
They can certainly enforce that you answer the survey. But it's very difficult to enforce a requirement that people answer questions accurately, particularly when they perceive that doing so will expose them to danger.
reply
I don't get what danger is being referenced here that exists only if the data is released to the public (in aggregate)?

The government is the primary and arguably only source of the danger, and they already have most of the data whether you answer the ACS correctly or not.

reply
[flagged]
reply
[flagged]
reply
Yet you have no retort
reply
[flagged]
reply
1. People give the information to the government under the expectation that this data is to be kept private or used in such a way that individual targeting is made impossible, you break that expectation and people will lie or won't give you this data.

2. Without noise injection it's rather simple to do statistical attacks to reverse engineer individual entities.

3. This data is and has already been used in the past to undermine democratic systems by targeting and disenfranchising minorities, as well as gerrymandering the US to hell.

4. "Too dangerous to make public, too dangerous to collect" - this is a false dichotomy. To govern effectively you need sensitive data, but it should be collected and used in a way that's safe for the individuals.

5. Macro level aggregates don't need individual exposure, that's why noise, anonymization and statistical functions are fine.

reply
Re point 1, not just an expectation, and explicit legal requirement.
reply
>If it's to dangerous to make public, it's too dangerous to collect, and people should be aware of exactly what it is.

While this may be a reasonable stance in theory, there are many examples in reality where the danger has not materialized for decades. Personally, I have access to health records, birth certificates, and death certificates collected by a state. They contain very personal information. As far as I know, they have not been leaked to the general public.

This is one of those situations where everything you hear tells you the system is failing, but that's because nobody talks about the systems which haven't failed.

Besides, this possible failing of the Census' privacy promises shouldn't convince us that "If only we hadn't given info to the despotic and cruel government using it to target people, then we'd only have a despotic and cruel government hurting people randomly." The solution to this problem isn't to withhold info, it's to get rid of the despots.

reply
> They should simply publish a full dataset of the census, with no such data coarsening/differential privacy/ etc...

They do. After a substantial delay. Pretty handy for geneological research, while protecting privacy for the living.

reply
That's a good default position, and I think should be our starting point.

But the devil is in the details. If we don't want advertisers constructing semi-complete profiles from simple web interactions then why would we publish 330 million census questionnaires for their use?

reply
So do you believe that individual income should be public? Or do you believe that the government should not take income into account for taxation or distribution of benefits?
reply
Then dox yourself right now with your previous census answers and PII. There are several obvious reasons to keep the data private, all you have to do is use your brain.
reply
I've never met a "privacy is irrelevant" advocate that doesn't close the door when they go to the toilet
reply
Don’t quit your day job. One guess as to what gender, sexual orientation, and skin colour you have.
reply
But why is the census asking about those attrbutes at all. The Constitution requires a count. That's it. A number. We don't need to know the rest of it, or if we do, it should be surveyed separately with voluntary participation.
reply
> We don't need to know the rest of it, or if we do, it should be surveyed separately with voluntary participation.

But we do. A detailed census is essential for making good policy. For example, knowing the age and distribution of children across the country helps local and state governments decide where to put the next school or children's hospital. The federal govt. allocates funds for education and daycare accordingly.

The census is the best and most important measure of govt. policy. Taking it away would leave everyone worse off.

reply
The risks of abuse are too high and historically proven to happen eventually. There are many other ways to determine where schools and hospitals are needed, such as aggregate enrollment and admission statistics.
reply
>There are many other ways to determine where schools and hospitals are needed, such as aggregate enrollment and admission statistics.

You do realize there are places where there aren't schools or hospitals?

reply
Local school districts know where they need more or fewer schools. This sort of thing isn't any business or responsibilty of the federal government at all.
reply
The census is already voluntary LOL. So we’d have two censuses?
reply
Census participation is not voluntary. Failure to provide complete or accurate data is, in theory, punishable by a fine. Last census, I intentionally provided incomplete data on the web form, which resulted in a person with a clipboard and some stern questions showing up at my door.
reply
[flagged]
reply
[flagged]
reply