{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# How many fact sheets survived the NAA website migration in 2019" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "import pandas as pd\n", "import requests\n", "from bs4 import BeautifulSoup" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Get the most recent version of the fact sheet index from the Internet Archive\n", "\n", "First we'll load the page." ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [], "source": [ "# Note the 'id_' in the url to get the original page without the IA navigation.\n", "response = requests.get(\n", " \"https://web.archive.org/web/20190716210347id_/http://www.naa.gov.au/collection/fact-sheets/by-number/index.aspx\"\n", ")" ] }, { "cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [], "source": [ "soup = BeautifulSoup(response.content)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Then we'll extract the rows from the index table." ] }, { "cell_type": "code", "execution_count": 4, "metadata": {}, "outputs": [], "source": [ "fs_list = soup.find(\"table\", title=\"Numerical list of fact sheets\").find_all(\"tr\")[1:]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Look for the fact sheets\n", "\n", "Let's loop through all the rows in the fact sheet index, extracting the fact sheet number, title and url. Then we'll try loading the url. We'll save all the details and the HTTP status code for further exploration." ] }, { "cell_type": "code", "execution_count": 5, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Reading room addresses and hours of opening: 200\n", "Using our collection: 404\n", "Addresses of Australian archival institutions: 404\n", "Reading room rules: 404\n", "What are archives?: 404\n", "Archival terms: 200\n", "The Commonwealth Record Series (CRS) system: 200\n", "Citing archival records: 200\n", "Copyright: 200\n", "Searching for records: 404\n", "Access to records under the Archives Act: 200\n", "Viewing records in the reading room: 404\n", "What to do if we refuse you access: 200\n", "RecordSearch: an overview: 404\n", "Keyword searching in RecordSearch Advanced search screens: 404\n", "Release of records containing personal information: 200\n", "Service guidelines for the National Reference Service: 404\n", "NameSearch: 200\n", "PhotoSearch: 404\n", "Parliamentary Papers: 404\n", "Commonwealth of Australia Gazettes: 200\n", "Customs House, Sydney: 200\n", "Coastal fortifications in New South Wales: 404\n", "Commonwealth Film Unit: 404\n", "The wine industry in South Australia: 404\n", "Tasmanian railways: 404\n", "Australia First Movement: 404\n", "Commonwealth banking policy: 404\n", "Navy service records: 404\n", "Navy crew and ships records: 404\n", "RAAF service records: 404\n", "Security intelligence records held in Canberra: 200\n", "Cabinet records: 404\n", "Administration of the Australian Capital Territory: 404\n", "Military records held in Hobart: 404\n", "Maritime records held in Hobart: 200\n", "Passenger records held in Canberra: 200\n", "Civilian service in World War II: 404\n", "Research agents – Canberra: 404\n", "Research agents – Sydney: 404\n", "Research agents – Brisbane: 404\n", "Research agents – Adelaide and Darwin: 404\n", "Research agents – Melbourne and Hobart: 404\n", "Research agents – Perth: 404\n", "Why we refuse access: 200\n", "Australian Overseas Information Service photographs: 404\n", "Papua New Guinea patrol reports: 404\n", "D Notices: 200\n", "Post Office records: 200\n", "Copying charges: 200\n", "Exempt information in ASIO records: 404\n", "Personal information in ASIO records: 404\n", "Veterans' case files: 200\n", "Fremantle Harbour: 404\n", "Passenger records held in Perth: 200\n", "Melbourne Olympics, 1956: 404\n", "World War I internee, alien and POW records held in Canberra: 404\n", "World War II internee, alien and POW records held in Canberra: 404\n", "Design and development of the national capital: 404\n", "World War II war crimes: 404\n", "Indonesian independence: 404\n", "War service information: 404\n", "Passenger records held in Sydney: 200\n", "Customs shipping records held in Sydney: 404\n", "Migrant selection documents held in Canberra: 404\n", "Boer War records: 404\n", "Naturalisation records held in Canberra: 200\n", "ASIO files on writers and literary groups: 404\n", "Prime ministers of Australia: 404\n", "Prime Minister Joseph Cook: 404\n", "Prime Minister William Morris Hughes: 404\n", "Prime Minister Stanley Melbourne Bruce: 404\n", "Prime Minister James Henry Scullin: 404\n", "Prime Minister Joseph Aloysius Lyons: 404\n", "Prime Minister Earle Christmas Grafton Page: 404\n", "Prime Minister Robert Gordon Menzies: 404\n", "Prime Minister Arthur William Fadden: 404\n", "Prime Minister John Joseph Ambrose Curtin: 404\n", "Prime Minister Francis Michael Forde: 404\n", "Prime Minister Joseph Benedict Chifley: 404\n", "Prime Minister Harold Edward Holt: 404\n", "Prime Minister John McEwen: 404\n", "Prime Minister John Grey Gorton: 404\n", "Family history sources held in Canberra: 404\n", "Family history sources held in Adelaide: 404\n", "Australia and the United Nations: 404\n", "Births, deaths and marriages: 404\n", "Cyclones and the Northern Territory: 404\n", "Coastal fortifications in South Australia: 404\n", "Customs houses in South Australia: 404\n", "Customs House, Port Adelaide, South Australia: 404\n", "Excise control of distilled products in South Australia: 404\n", "Walter Burley Griffin and the design of Canberra: 404\n", "J T Lang and Lang Labor: 200\n", "Regulation of beer and brewing in South Australia: 404\n", "Sir Frederick Shedden and the Shedden collection: 404\n", "Records relating to Italian migration held in Sydney: 404\n", "World War II internee, alien and POW records held in Sydney: 404\n", "The Australian flag: 200\n", "The Cocos (Keeling) Islands: 404\n", "Commonwealth electoral rolls held in Perth: 404\n", "Copyright records: 404\n", "World War I internee, alien and POW records held in Adelaide: 404\n", "World War II internee, alien and POW records held in Adelaide: 404\n", "The Pastoral industry in the Northern Territory: 404\n", "Building the provisional Parliament House: 404\n", "When to use the Freedom of Information, Archives and Privacy Acts: 200\n", "The sinking of HMAS Sydney, November 1941: 200\n", "Royal Commission into Aboriginal Deaths in Custody: 200\n", "Aboriginal and Torres Strait Islander people: 404\n", "Memorandum of Understanding with Northern Territory Aboriginal people: 404\n", "Introducing television to Australia, 1956: 404\n", "Guides to the collection: 404\n", "Australia's involvement in the Vietnam War: 200\n", "Computer resources in reading rooms: 404\n", "Commonwealth electoral rolls held in Brisbane: 404\n", "Bankruptcy records held in Sydney: 404\n", "General Sir John Monash: 404\n", "Lighthouse records held in Hobart: 404\n", "Records of British migrants held in Canberra: 404\n", "Child migration to Australia: 200\n", "Radar research in Australia during World War II: 404\n", "Radar production and use during World War II: 404\n", "War Cabinet records: 404\n", "Cabinet notebooks: 404\n", "British nuclear tests at Maralinga: 404\n", "The Royal Commission on Espionage, 1954–55: 404\n", "Posters: 200\n", "World War ll Army pay files held in Adelaide: 404\n", "Defence and service records held in Melbourne: 404\n", "Colonial defence personnel records held in Melbourne: 404\n", "Army administrative records held in Melbourne: 404\n", "Army service records: 404\n", "Navy administrative records held in Melbourne: 404\n", "Navy service records held in Melbourne: 404\n", "Royalty and Australian society: 404\n", "Cockatoo Island Dockyard: 404\n", "Stevedoring industry: 404\n", "Canberra air disaster, 1940: 404\n", "North Head Quarantine Station, Sydney: 404\n", "Harold Holt's disappearance, 1967: 200\n", "Albert Namatjira: 404\n", "Jessie Sinclair Litchfield: 404\n", "Child migrant records held in Sydney: 404\n", "Records of Papua New Guinea, 1883–1942: 200\n", "Sound collections held in Sydney: 404\n", "The 1967 referendum: 200\n", "Bishop Francis Xavier Gsell: 404\n", "Army and RAAF pay records held in Perth: 404\n", "ABC Talks Department scripts: 404\n", "External Affairs cables: 404\n", "Edward John Connellan and Connellan Airways: 403\n", "Records of Dutch migration held in Sydney: 404\n", "Christmas Island: 404\n", "Foundation of the State of Israel, 1946: 404\n", "Reverend John Flynn and the Australian Inland Mission: 404\n", "Universal military training in Australia, 1911–29: 200\n", "Conscription referendums, 1916 and 1917: 200\n", "National Service and war, 1939–45: 200\n", "National Service, 1951–59: 404\n", "National Service, 1965–72: 200\n", "Royal Military College, Duntroon: 404\n", "Government House, Canberra: 404\n", "Mount Stromlo Observatory: 404\n", "Gorman House, Canberra: 404\n", "Migrant hostels in New South Wales, 1946–78: 200\n", "World War I internee, alien and POW records held in Sydney: 404\n", "Passenger records held in Melbourne: 200\n", "Security intelligence records in Melbourne: 404\n", "East Block building, Canberra: 404\n", "Bringing Them Home name index: 200\n", "Cyclone Tracy, Darwin: 200\n", "World War I and World War II service records: 200\n", "Commonwealth Reconstruction Training Scheme (CRTS) administrative records: 404\n", "Commonwealth Reconstruction Training Scheme (CRTS) applicants and trainees: 404\n", "Wartime internee, alien and POW records held in Perth: 200\n", "Civil Constructional Corps records held in Perth: 404\n", "Civil Alien Corps records held in Perth: 404\n", "New Guard Movement, 1931–35: 404\n", "Passenger records held in Hobart: 404\n", "Migrant selection documents held in Perth: 404\n", "Alien registration records held in Perth: 404\n", "Citizenship in Australia: 200\n", "Empire Games, Sydney, 1938: 403\n", "General Post Office, Sydney: 200\n", "Passenger records held in Brisbane: 404\n", "Aerial photographs: 404\n", "Japanese midget submarine attacks on Sydney, 1942: 200\n", "Addresses of other national archives: 404\n", "Australian Antarctic exploration and research: 404\n", "The bombing of Darwin: 200\n", "Sir Charles Kingsford Smith: 404\n", "Photographs relating to Sir Charles Kingsford Smith: 404\n", "Cowra breakout, 1944: 200\n", "Army Inventions Directorate, 1942–46: 404\n", "Beginning your family history research: 404\n", "Tracing ancestors in the National Archives: 404\n", "Tracing ancestors beyond the National Archives: 404\n", "The House of Representatives Standing Committee of Privileges: 404\n", "The Browne-Fitzpatrick privilege case, 1955: 404\n", "Memorandum of Understanding with the Victorian Aboriginal Child Care Agency: 404\n", "Records relating to Italian migration held in Perth: 404\n", "Research Agents – overseas institutions: 404\n", "Memorandum of Understanding with South Australian Indigenous people: 404\n", "Prime Minister Edmund Barton: 404\n", "Prime Minister Alfred Deakin: 404\n", "Prime Minister John Christian Watson: 404\n", "Prime Minister George Houstoun Reid: 404\n", "Prime Minister Andrew Fisher: 404\n", "Prime Minister William McMahon: 404\n", "Prime Minister Edward Gough Whitlam: 404\n", "The Jewish experience in Australia: 404\n", "The National Archives collecting policy: 404\n", "Special access: 404\n", "Passenger arrivals index: 200\n", "High Court of Australia: 404\n", "Mildenhall photographic collection: 404\n", "Migrant selection documents in Adelaide: 200\n", "The Wave Hill walk-off: 200\n", "Charles Nelson Perkins: 404\n", "Custom House, Brisbane: 404\n", "Immigration records: 404\n", "Torrens Island Quarantine Station, South Australia: 200\n", "Access to damaged, fragile or contaminated records: 404\n", "Using cameras in the reading room: 404\n", "Neville Bonner: 404\n", "Industrial relations records held in Melbourne: 404\n", "Lighthouse records held in Brisbane: 404\n", "United States forces in Queensland, 1941–45: 200\n", "Francis Edgar Williams, anthropologist of Papua: 404\n", "Records relating to Italian migration held in Brisbane: 404\n", "International Women's Year, 1975: 404\n", "The 'Balibo affair', East Timor, October 1975: 404\n", "The loans affair, 1974–75: 404\n", "The dismissal, 1975: 404\n", "John Robert Kerr, Governor-General of Australia, 1974–77: 404\n", "Prime Minister John Malcolm Fraser: 404\n", "The fall of Saigon, 1975: 404\n", "Industrial development in Australia after World War II: 404\n", "Patent, trademark and design records in Brisbane: 404\n", "Cabinet records of the Fraser government, 1975–83: 404\n", "Australia's diplomatic relations with China: 403\n", "Daniel Mannix, Catholic Archbishop of Melbourne: 200\n", "The National Archives digitisation service: 404\n", "Albert Hall, Canberra: 404\n", "Australia’s national anthem: 200\n", "Tobacco advertising ban in Australia: 404\n", "Australian Atomic Energy Commission: 404\n", "The Immigration Photographic Archive: 404\n", "Australia and the issue of apartheid in sport: 404\n", "Passenger records held in Adelaide: 200\n", "Official access: 404\n", "Torres Strait Treaty, 1978: 404\n", "South Australian lighthouse records: 404\n", "South Australian maritime records: 404\n", "Independence of Papua New Guinea, 1975: 404\n", "Royal Commission on Intelligence and Security: 404\n", "Independence of Zimbabwe: 404\n", "Finding records relating to an Indigenous person: 404\n", "Patent records held in Canberra: 404\n", "The sinking of the Montevideo Maru: 404\n", "Robert James Lee Hawke: 404\n", "Aboriginal petitions: 404\n", "South Sea Islanders: 404\n", "Indigenous family history beyond the National Archives: 404\n", "Downer family collection: 404\n" ] } ], "source": [ "fact_sheets = []\n", "for row in fs_list:\n", " num = row.td.text\n", " fs = row.find(\"a\")\n", " title = fs.text\n", " url = f'http://naa.gov.au{fs[\"href\"]}'\n", " response = requests.get(url)\n", " status = response.status_code\n", " print(f\"{title}: {status}\")\n", " fact_sheets.append({\"number\": num, \"title\": title, \"url\": url, \"status\": status})" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Examine the results" ] }, { "cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [], "source": [ "df = pd.DataFrame(fact_sheets)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Let's break down the results by HTTP status code." ] }, { "cell_type": "code", "execution_count": 8, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "404 207\n", "200 56\n", "403 3\n", "Name: status, dtype: int64" ] }, "execution_count": 8, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df[\"status\"].value_counts()" ] }, { "cell_type": "code", "execution_count": 11, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "78.95% of fact sheets are kaput!\n" ] } ], "source": [ "print(f\"{(207 + 3) / (207 + 56 + 3):.2%} of fact sheets are kaput!\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Which fact sheets have survived?" ] }, { "cell_type": "code", "execution_count": 10, "metadata": {}, "outputs": [ { "data": { "text/html": [ "
\n", "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
numbertitleurlstatus
01Reading room addresses and hours of openinghttp://naa.gov.au/collection/fact-sheets/fs01....200
55Archival termshttp://naa.gov.au/collection/fact-sheets/fs05....200
66The Commonwealth Record Series (CRS) systemhttp://naa.gov.au/collection/fact-sheets/fs06....200
77Citing archival recordshttp://naa.gov.au/collection/fact-sheets/fs07....200
88Copyrighthttp://naa.gov.au/collection/fact-sheets/fs08....200
1010Access to records under the Archives Acthttp://naa.gov.au/collection/fact-sheets/fs10....200
1212What to do if we refuse you accesshttp://naa.gov.au/collection/fact-sheets/fs12....200
1515Release of records containing personal informa...http://naa.gov.au/collection/fact-sheets/fs15....200
1718NameSearchhttp://naa.gov.au/collection/fact-sheets/fs18....200
2022Commonwealth of Australia Gazetteshttp://naa.gov.au/collection/fact-sheets/fs22....200
2123Customs House, Sydneyhttp://naa.gov.au/collection/fact-sheets/fs23....200
3133Security intelligence records held in Canberrahttp://naa.gov.au/collection/fact-sheets/fs33....200
3537Maritime records held in Hobarthttp://naa.gov.au/collection/fact-sheets/fs37....200
3638Passenger records held in Canberrahttp://naa.gov.au/collection/fact-sheets/fs38....200
4446Why we refuse accesshttp://naa.gov.au/collection/fact-sheets/fs46....200
4749D Noticeshttp://naa.gov.au/collection/fact-sheets/fs49....200
4850Post Office recordshttp://naa.gov.au/collection/fact-sheets/fs50....200
4951Copying chargeshttp://naa.gov.au/collection/fact-sheets/fs51....200
5254Veterans' case fileshttp://naa.gov.au/collection/fact-sheets/fs54....200
5456Passenger records held in Perthhttp://naa.gov.au/collection/fact-sheets/fs56....200
6264Passenger records held in Sydneyhttp://naa.gov.au/collection/fact-sheets/fs64....200
6668Naturalisation records held in Canberrahttp://naa.gov.au/collection/fact-sheets/fs68....200
9396J T Lang and Lang Laborhttp://naa.gov.au/collection/fact-sheets/fs96....200
98102The Australian flaghttp://naa.gov.au/collection/fact-sheets/fs102...200
106110When to use the Freedom of Information, Archiv...http://naa.gov.au/collection/fact-sheets/fs110...200
107111The sinking of HMAS Sydney, November 1941http://naa.gov.au/collection/fact-sheets/fs111...200
108112Royal Commission into Aboriginal Deaths in Cus...http://naa.gov.au/collection/fact-sheets/fs112...200
113117Australia's involvement in the Vietnam Warhttp://naa.gov.au/collection/fact-sheets/fs117...200
120124Child migration to Australiahttp://naa.gov.au/collection/fact-sheets/fs124...200
127131Postershttp://naa.gov.au/collection/fact-sheets/fs131...200
140144Harold Holt's disappearance, 1967http://naa.gov.au/collection/fact-sheets/fs144...200
144148Records of Papua New Guinea, 1883–1942http://naa.gov.au/collection/fact-sheets/fs148...200
146150The 1967 referendumhttp://naa.gov.au/collection/fact-sheets/fs150...200
156160Universal military training in Australia, 1911–29http://naa.gov.au/collection/fact-sheets/fs160...200
157161Conscription referendums, 1916 and 1917http://naa.gov.au/collection/fact-sheets/fs161...200
158162National Service and war, 1939–45http://naa.gov.au/collection/fact-sheets/fs162...200
160164National Service, 1965–72http://naa.gov.au/collection/fact-sheets/fs164...200
165170Migrant hostels in New South Wales, 1946–78http://naa.gov.au/collection/fact-sheets/fs170...200
167172Passenger records held in Melbournehttp://naa.gov.au/collection/fact-sheets/fs172...200
170175Bringing Them Home name indexhttp://naa.gov.au/collection/fact-sheets/fs175...200
171176Cyclone Tracy, Darwinhttp://naa.gov.au/collection/fact-sheets/fs176...200
172177World War I and World War II service recordshttp://naa.gov.au/collection/fact-sheets/fs177...200
175180Wartime internee, alien and POW records held i...http://naa.gov.au/collection/fact-sheets/fs180...200
182187Citizenship in Australiahttp://naa.gov.au/collection/fact-sheets/fs187...200
184189General Post Office, Sydneyhttp://naa.gov.au/collection/fact-sheets/fs189...200
187192Japanese midget submarine attacks on Sydney, 1942http://naa.gov.au/collection/fact-sheets/fs192...200
190195The bombing of Darwinhttp://naa.gov.au/collection/fact-sheets/fs195...200
193198Cowra breakout, 1944http://naa.gov.au/collection/fact-sheets/fs198...200
214220Passenger arrivals indexhttp://naa.gov.au/collection/fact-sheets/fs220...200
217223Migrant selection documents in Adelaidehttp://naa.gov.au/collection/fact-sheets/fs223...200
218224The Wave Hill walk-offhttp://naa.gov.au/collection/fact-sheets/fs224...200
222228Torrens Island Quarantine Station, South Austr...http://naa.gov.au/collection/fact-sheets/fs228...200
228234United States forces in Queensland, 1941–45http://naa.gov.au/collection/fact-sheets/fs234...200
242248Daniel Mannix, Catholic Archbishop of Melbournehttp://naa.gov.au/collection/fact-sheets/fs248...200
245251Australia’s national anthemhttp://naa.gov.au/collection/fact-sheets/fs251...200
250256Passenger records held in Adelaidehttp://naa.gov.au/collection/fact-sheets/fs256...200
\n", "
" ], "text/plain": [ " number title \\\n", "0 1 Reading room addresses and hours of opening \n", "5 5 Archival terms \n", "6 6 The Commonwealth Record Series (CRS) system \n", "7 7 Citing archival records \n", "8 8 Copyright \n", "10 10 Access to records under the Archives Act \n", "12 12 What to do if we refuse you access \n", "15 15 Release of records containing personal informa... \n", "17 18 NameSearch \n", "20 22 Commonwealth of Australia Gazettes \n", "21 23 Customs House, Sydney \n", "31 33 Security intelligence records held in Canberra \n", "35 37 Maritime records held in Hobart \n", "36 38 Passenger records held in Canberra \n", "44 46 Why we refuse access \n", "47 49 D Notices \n", "48 50 Post Office records \n", "49 51 Copying charges \n", "52 54 Veterans' case files \n", "54 56 Passenger records held in Perth \n", "62 64 Passenger records held in Sydney \n", "66 68 Naturalisation records held in Canberra \n", "93 96 J T Lang and Lang Labor \n", "98 102 The Australian flag \n", "106 110 When to use the Freedom of Information, Archiv... \n", "107 111 The sinking of HMAS Sydney, November 1941 \n", "108 112 Royal Commission into Aboriginal Deaths in Cus... \n", "113 117 Australia's involvement in the Vietnam War \n", "120 124 Child migration to Australia \n", "127 131 Posters \n", "140 144 Harold Holt's disappearance, 1967 \n", "144 148 Records of Papua New Guinea, 1883–1942 \n", "146 150 The 1967 referendum \n", "156 160 Universal military training in Australia, 1911–29 \n", "157 161 Conscription referendums, 1916 and 1917 \n", "158 162 National Service and war, 1939–45 \n", "160 164 National Service, 1965–72 \n", "165 170 Migrant hostels in New South Wales, 1946–78 \n", "167 172 Passenger records held in Melbourne \n", "170 175 Bringing Them Home name index \n", "171 176 Cyclone Tracy, Darwin \n", "172 177 World War I and World War II service records \n", "175 180 Wartime internee, alien and POW records held i... \n", "182 187 Citizenship in Australia \n", "184 189 General Post Office, Sydney \n", "187 192 Japanese midget submarine attacks on Sydney, 1942 \n", "190 195 The bombing of Darwin \n", "193 198 Cowra breakout, 1944 \n", "214 220 Passenger arrivals index \n", "217 223 Migrant selection documents in Adelaide \n", "218 224 The Wave Hill walk-off \n", "222 228 Torrens Island Quarantine Station, South Austr... \n", "228 234 United States forces in Queensland, 1941–45 \n", "242 248 Daniel Mannix, Catholic Archbishop of Melbourne \n", "245 251 Australia’s national anthem \n", "250 256 Passenger records held in Adelaide \n", "\n", " url status \n", "0 http://naa.gov.au/collection/fact-sheets/fs01.... 200 \n", "5 http://naa.gov.au/collection/fact-sheets/fs05.... 200 \n", "6 http://naa.gov.au/collection/fact-sheets/fs06.... 200 \n", "7 http://naa.gov.au/collection/fact-sheets/fs07.... 200 \n", "8 http://naa.gov.au/collection/fact-sheets/fs08.... 200 \n", "10 http://naa.gov.au/collection/fact-sheets/fs10.... 200 \n", "12 http://naa.gov.au/collection/fact-sheets/fs12.... 200 \n", "15 http://naa.gov.au/collection/fact-sheets/fs15.... 200 \n", "17 http://naa.gov.au/collection/fact-sheets/fs18.... 200 \n", "20 http://naa.gov.au/collection/fact-sheets/fs22.... 200 \n", "21 http://naa.gov.au/collection/fact-sheets/fs23.... 200 \n", "31 http://naa.gov.au/collection/fact-sheets/fs33.... 200 \n", "35 http://naa.gov.au/collection/fact-sheets/fs37.... 200 \n", "36 http://naa.gov.au/collection/fact-sheets/fs38.... 200 \n", "44 http://naa.gov.au/collection/fact-sheets/fs46.... 200 \n", "47 http://naa.gov.au/collection/fact-sheets/fs49.... 200 \n", "48 http://naa.gov.au/collection/fact-sheets/fs50.... 200 \n", "49 http://naa.gov.au/collection/fact-sheets/fs51.... 200 \n", "52 http://naa.gov.au/collection/fact-sheets/fs54.... 200 \n", "54 http://naa.gov.au/collection/fact-sheets/fs56.... 200 \n", "62 http://naa.gov.au/collection/fact-sheets/fs64.... 200 \n", "66 http://naa.gov.au/collection/fact-sheets/fs68.... 200 \n", "93 http://naa.gov.au/collection/fact-sheets/fs96.... 200 \n", "98 http://naa.gov.au/collection/fact-sheets/fs102... 200 \n", "106 http://naa.gov.au/collection/fact-sheets/fs110... 200 \n", "107 http://naa.gov.au/collection/fact-sheets/fs111... 200 \n", "108 http://naa.gov.au/collection/fact-sheets/fs112... 200 \n", "113 http://naa.gov.au/collection/fact-sheets/fs117... 200 \n", "120 http://naa.gov.au/collection/fact-sheets/fs124... 200 \n", "127 http://naa.gov.au/collection/fact-sheets/fs131... 200 \n", "140 http://naa.gov.au/collection/fact-sheets/fs144... 200 \n", "144 http://naa.gov.au/collection/fact-sheets/fs148... 200 \n", "146 http://naa.gov.au/collection/fact-sheets/fs150... 200 \n", "156 http://naa.gov.au/collection/fact-sheets/fs160... 200 \n", "157 http://naa.gov.au/collection/fact-sheets/fs161... 200 \n", "158 http://naa.gov.au/collection/fact-sheets/fs162... 200 \n", "160 http://naa.gov.au/collection/fact-sheets/fs164... 200 \n", "165 http://naa.gov.au/collection/fact-sheets/fs170... 200 \n", "167 http://naa.gov.au/collection/fact-sheets/fs172... 200 \n", "170 http://naa.gov.au/collection/fact-sheets/fs175... 200 \n", "171 http://naa.gov.au/collection/fact-sheets/fs176... 200 \n", "172 http://naa.gov.au/collection/fact-sheets/fs177... 200 \n", "175 http://naa.gov.au/collection/fact-sheets/fs180... 200 \n", "182 http://naa.gov.au/collection/fact-sheets/fs187... 200 \n", "184 http://naa.gov.au/collection/fact-sheets/fs189... 200 \n", "187 http://naa.gov.au/collection/fact-sheets/fs192... 200 \n", "190 http://naa.gov.au/collection/fact-sheets/fs195... 200 \n", "193 http://naa.gov.au/collection/fact-sheets/fs198... 200 \n", "214 http://naa.gov.au/collection/fact-sheets/fs220... 200 \n", "217 http://naa.gov.au/collection/fact-sheets/fs223... 200 \n", "218 http://naa.gov.au/collection/fact-sheets/fs224... 200 \n", "222 http://naa.gov.au/collection/fact-sheets/fs228... 200 \n", "228 http://naa.gov.au/collection/fact-sheets/fs234... 200 \n", "242 http://naa.gov.au/collection/fact-sheets/fs248... 200 \n", "245 http://naa.gov.au/collection/fact-sheets/fs251... 200 \n", "250 http://naa.gov.au/collection/fact-sheets/fs256... 200 " ] }, "execution_count": 10, "metadata": {}, "output_type": "execute_result" } ], "source": [ "df.loc[df[\"status\"] == 200]" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "## Save the results as a CSV" ] }, { "cell_type": "code", "execution_count": 12, "metadata": { "tags": [ "nbval-skip" ] }, "outputs": [], "source": [ "df.to_csv(\"data/fact_sheets.csv\", index=False)" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.8.12 (default, May 16 2022, 14:53:00) \n[GCC 11.2.0]" }, "vscode": { "interpreter": { "hash": "f54aba2de7a75230217f549a064c6555500d2132634fbcab9606dbfda34a2a1b" } }, "widgets": { "application/vnd.jupyter.widget-state+json": { "state": {}, "version_major": 2, "version_minor": 0 } } }, "nbformat": 4, "nbformat_minor": 4 }