--- layout: default title: Covid Positive parent: Curated Assets nav_order: 3 permalink: /docs/curated_assets/covid_positive --- # Covid Positive View code on GitHub The ***covid_positive*** table is a consolidated table for positive COVID-19 case records derived from antigen testing, primary and secondary care data (including SGSS, Pillar 2 antigen testing, GDPPR, HES APC (diagnosis) and CHESS). Positive COVID-19 records are extracted from each dataset and records with a distinct person ID and non-null dates are retained. The tables from each extraction are then combined to create a unified dataset of all COVID-19 positive records. The output table includes six columns: - **person_id** - **date** - **code** - **description** - **covid_status** (confirmed/suspected) - **data_source** {: .note } The methodology for curating the table is further outlined in this paper. The table is saved to the DSA schema **dsa_391419_j3w9t_collab**. The archived_on_date is in the format **YYYY_MM_DD**. {: .highlight-title } > Table Name > > > > hds_curated_assets__covid_positive_archived_on_date The below code will load the hes_apc_diagnosis table as at October 2024 using PySpark: {% highlight markdown %} ```python import pyspark.sql.functions as f dsa = f'dsa_391419_j3w9t_collab' covid_positive = spark.table(f'{dsa}.hds_curated_assets__covid_positive_2024_10_24') ``` {% endhighlight %}