Home » Posts tagged 'Congressional data'
Tag Archives: Congressional data
Here’s an interesting dataset to bookmark. After a 10-year moratorium on earmarks, the House Appropriations Committee recently released PDF tables of fiscal year 2022 congressionally directed spending projects. But those PDFs aren’t actually usable for any sort of deeper data analysis. So the Congress Project at the Bipartisan Policy Center has just released the Congressionally Directed Spending FY2022 Dataset. It’s a database of all the FY22 “Final Funded Projects” in H.R. 2471, with some additional member data included (the original tables often only include last name). Check out the Congress Project’s blog post for more on how they extracted and cleaned the data. Good work by the Bipartisan Policy Center!
Data is plural newsletter posts 2 amazing govinfo datasets: House Comm witnesses and 1900 census immigrant populations
I love my Data Is Plural newsletter, Jeremy Singer-Vine’s weekly newsletter of useful/curious datasets! You can check out his archive from 2015(!) to present and also explore the archive as a google spreadsheet or as Markdown files (a dataset of interesting datasets :-)).
Today’s edition was especially good on the govinfo front: 2 really awesome datasets on House Committee witnesses (1971 – 2016) and a high-resolution transcription and CSV file of the Census Bureau’s 1900 report on immigrant populations. Check them out, and don’t forget to subscribe to the Data is plural newsletter!
House committee witnesses. Political scientists Lauren C. Bell and J.D. Rackey have compiled a spreadsheet of 435,000+ people testifying before the US House of Representatives from 1971 to 2016. They began with a text file scraped from a ProQuest database, provided by the authors of a dataset that focused on social scientists’ testimony (DIP 2020.12.23). Then, they determined each witness’s first and last name; type of organization; the committee, date, title, and summary of the relevant hearing; and more.
Immigrant populations in 1900. The 1900 US Census’s public report includes a table counting the foreign-born residents of each state and territory — overall and disaggregated into a few dozen origins, which range from subdivisions of countries (Poland is split into “Austrian,” “German,” “Russian,” and “unknown” columns) to entire continents (“Africa”). It’s officially available as a low-resolution PDF. Reporters at Stacker, however, recently transcribed it into a CSV file for easier use.
How a complex network of bills becomes a law: GovTrack introduces new data analysis of text incorporation
Here’s a fascinating new way to look at US Congressional legislation from our friends at GovTrack.us. As Josh Tauberer explains, GovTrack’s new service “Enacted Via Other Measures,” their new data analysis of text incorporation, will now provide connections between bills — when a bill has at least about 33% of its provisions incorporated into one or more enacted bills — in order to show “how a complex network of bills becomes a law.”
No longer will legislative trackers be limited to the 6 stages in becoming a law described on Congress.gov, or even the 13 steps described by this handy infographic by Mike Wirth and Suzanne Cooper-Guasco (“How Our Laws Are Made”, First place award in the Design for America contest, 2010). Now we’ll be able to see the various pieces of bills that make it into other bills.
This is an amazing new looking glass into the legislative process. Thanks GovTrack.us!
This new analysis literally doubles our insight.
Only about 3% of bills will be enacted through the signature of the President or a veto override. Another 1% are identical to those bills, so-called “companion bills,” which are easily identified (see CRS, below). Our new analysis reveals almost another 3% of bills which had substantial parts incorporated into an enacted bill in 2015–2016. To miss that last 3% is to be practically 100% wrong about how many bills are being enacted by Congress.
And there may be even more than that, which we’ll find out as we tweak our methodology in the future.
There are so many new questions to answer:
- Who are the sources of these enacted provisions?
- How often is this cut-and-paste process cross-partisan?
- What provisions were removed from a bill to be enacted?
- Is cut-and-paste more frequent today than in the past?
Sign up now for this year’s Legislative Data and Transparency Conference (#LDTC16) held in Washington DC. In years past, they’ve streamed the proceedings, so definitely sign up for free if you’re interested in open legislative data, even if you’re not in the DC area!
The 2016 Legislative Data and Transparency Conference (#LDTC16), hosted by the Committee on House Administration, will take place on Tuesday, June 21, 2016, from 9 a.m. to 4 p.m. in the Capitol Visitor Center Congressional Auditorium.
The #LDTC16 brings individuals from Legislative Branch agencies together with data users and transparency advocates to foster a conversation about the use of legislative data – addressing how agencies use technology well and how they can use it better in the future.
Tuesday, June 21, 2016 from 9:00 AM to 4:00 PM (EDT) – Add to Calendar
Capitol Visitor Center Congressional Auditorium
I’ve had a tab open to this ProPublica post “A New Way to Keep an Eye on Who Represents You in Congress” for a couple of weeks and just now getting around to sharing. Their new project called “Represent” is a great way to track on lawmakers, the bills they consider and the votes they take (and miss). Search for your legislators by address, ZIP code or name. A very handy tool indeed. But 2 things stand out especially about this new effort: 1) “Represent” not only collates data from a variety of government resources (see below) but they also point out to other sites that offer valuable features like individual lawmaker and bill pages on GovTrack and C-SPAN; and 2) They’re making available all the data that they use through their API. Their data sources include:
- The official Web site of the Office of the Clerk of the U.S. House of Representatives, for vote data
- The official Web site of the United States Senate, for vote data
- The Biographical Directory of the United States Congress, for member biographical information
- The United States Project, for social media account names in member lists and some member biographical information
- MIT Professor Charles Stewart’s collection of Congressional data, for some role information
- Congress.gov (The Library of Congress) and the Government Publishing Office, for bill data and nomination data
Check it out, bookmark it, and let your library patrons know about it!
Today ProPublica is launching a new interactive database that will help you keep track of the officials who represent you in Congress.
The project is the continuation of two projects I worked on at The New York Times — the first is the Inside Congress database, which we are taking over at ProPublica starting today.
But we also have big plans for it. While the original interactive database at The Times focused on bills and votes, our new project adds pages for each elected official, where you can find their latest votes, legislation they support and statistics about their voting. As we move forward we want to add much more data to help you understand how your elected officials represent you, the incentives that drive them and the issues they care about.
In that way, it is also a continuation of another project I worked on at the Times. In late 2008, The New York Times launched an app called Represent that connected city residents with the officials who represented them at the local, state and federal levels. It was an experiment in trying to make it easier to keep track of what elected officials were doing.
Because ProPublica is rekindling that effort, we’re calling the new project Represent.
The new Represent will help you track members, votes and bills in the House of Representatives and Senate. We’re also launching a Congress API, or Application Programming Interface, so developers can get data about what Congress is doing, too.