{"_id":"597f67d3ebbd9c0014916e8a","project":"5773dcfc255e820e00e1cd4d","version":{"_id":"5773dcfc255e820e00e1cd50","__v":26,"project":"5773dcfc255e820e00e1cd4d","createdAt":"2016-06-29T14:36:44.812Z","releaseDate":"2016-06-29T14:36:44.812Z","categories":["5773dcfc255e820e00e1cd51","5773df36904b0c0e00ef05ff","577baf92451b1e0e006075ac","577bb183b7ee4a0e007c4e8d","577ce77a1cf3cb0e0048e5ea","577d11865fd4de0e00cc3dab","578e62792c3c790e00937597","578f4fd98335ca0e006d5c84","578f5e5c3d04570e00976ebb","57bc35f7531e000e0075d118","57f801b3760f3a1700219ebb","5804d55d1642890f00803623","581c8d55c0dc651900aa9350","589dcf8ba8c63b3b00c3704f","594cebadd8a2f7001b0b53b2","59a562f46a5d8c00238e309a","5a2aa096e25025003c582b58","5a2e79566c771d003ca0acd4","5a3a5166142db90026f24007","5a3a52b5bcc254001c4bf152","5a3a574a2be213002675c6d2","5a3a66bb2be213002675cb73","5a3a6e4854faf60030b63159","5c8a68278e883901341de571","5cb9971e57bf020024523c7b","5cbf1683e2a36d01d5012ecd"],"is_deprecated":false,"is_hidden":false,"is_beta":false,"is_stable":true,"codename":"","version_clean":"1.0.0","version":"1.0"},"category":{"_id":"57f801b3760f3a1700219ebb","version":"5773dcfc255e820e00e1cd50","__v":0,"project":"5773dcfc255e820e00e1cd4d","sync":{"url":"","isSync":false},"reference":false,"createdAt":"2016-10-07T20:12:35.170Z","from_sync":false,"order":7,"slug":"browse-datasets","title":"Browse public datasets"},"user":"575e85ac41c8ba0e00259a44","githubsync":"","__v":0,"parentDoc":null,"updates":[],"next":{"pages":[],"description":""},"createdAt":"2017-07-31T17:24:35.426Z","link_external":false,"link_url":"","sync_unique":"","hidden":false,"api":{"results":{"codes":[]},"settings":"","auth":"required","params":[],"url":""},"isReference":false,"order":19,"body":"##Overview\n\nBuild a query across multiple datasets at once using harmonized metadata ontology. Metadata consists of properties, which describe each dataset’s entities, and their values. Entities are particular resources with UUIDs, such as files, cases, samples, and cell lines. Learn more [about metadata for datasets](about-metadata-for-datasets).\n\nThis page walks you through building a query across several datasets. Learn more about the [Data Browser features](doc:data-browser-features) and the [parts of a Data Browser query](doc:structure-of-a-data-browser-query).\n\n##Objective\n\nThe query is performed across several datasets:\n  * [TCGA GRCh38 data](doc:tcga-grch38-data) \n  * [Cavatica datasets](doc:datasets-overview) (paediatric cancer data)\n\nThe query selects Cases that:\n  * are females, and\n  * have been analyzed with RNA-Seq, and\n  * are diagnosed with Acute Myeloid Leukemia, Medulloblastoma, or Atypical Teratoid Rhabdoid Tumor\n \n##Procedure\n\n###[ 1 ] Choose the datasets to be queried\n\n1. [Access the Data Browser](doc:about-the-data-browser#section-method-1-access-the-data-browser-from-the-top-navigation-bar).\n2. Select **TCGA GRCh38** and **Cavatica**.\n3. Click **Explore selected**.\n\nNote that by selecting **Cavatica**, you are selecting all paediatric cancer datasets available through the Platform for your query. If you want to query a subset of the paediatric cancer datasets, you have to select them individually. \n \n###[ 2 ] Build the query\n\n1. Click on the **Case** entity to select for patients which match your query parameters.\n2. Select **Demographic** from the list of entities connected with the **Case** entity.\n3. Click **+Add property** below **Demographic**.\n4. Search for **female** and select **Gender: female**.\n5. Select **File** from the list of entities connected with the **Case** entity.\n6. Click **+Add property** below **File**.\n7. Search for **rna-seq**, and select **Experimental strategy: RNA-seq**.\n8. Select **Investigation** from the list of entities connected with the **File** entity.\n9. Click **+Add property** below **Investigation**.\n10. Search for **Leukaemia**, **Medulloblastoma**, **Rhabdoid**, and select **Disease type: Acute Myeloid Leukaemia**, **Medulloblastoma**, and **Atypical Teratoid Rhabdoid Tumor**, respectively.\n \n###[ 3 ] Save the query and import the results to your project\n\n1. Click **Save** from the **Queries** drop-down menu.\n2. Name your query and add an optional description.\n3. Click **Save query**.\n4. [Import the query results to your project](doc:access-data-from-the-data-browser)\n\nNote that you will not be able to import restricted query result data without the right level of access. Files you cannot access are labelled with a red closed lock. This could occur, for example, with TCGA Controlled Data if you don't have permission from dbGaP. Read more about [accessing data from the Data Browser](doc:access-data-from-the-data-browser).\n\nThat's it: you've successfully built a query across several datasets!","excerpt":"","slug":"data-browser-query-multiple-dataset-query","type":"basic","title":"Data Browser query: multiple dataset query"}

Data Browser query: multiple dataset query


##Overview Build a query across multiple datasets at once using harmonized metadata ontology. Metadata consists of properties, which describe each dataset’s entities, and their values. Entities are particular resources with UUIDs, such as files, cases, samples, and cell lines. Learn more [about metadata for datasets](about-metadata-for-datasets). This page walks you through building a query across several datasets. Learn more about the [Data Browser features](doc:data-browser-features) and the [parts of a Data Browser query](doc:structure-of-a-data-browser-query). ##Objective The query is performed across several datasets: * [TCGA GRCh38 data](doc:tcga-grch38-data) * [Cavatica datasets](doc:datasets-overview) (paediatric cancer data) The query selects Cases that: * are females, and * have been analyzed with RNA-Seq, and * are diagnosed with Acute Myeloid Leukemia, Medulloblastoma, or Atypical Teratoid Rhabdoid Tumor ##Procedure ###[ 1 ] Choose the datasets to be queried 1. [Access the Data Browser](doc:about-the-data-browser#section-method-1-access-the-data-browser-from-the-top-navigation-bar). 2. Select **TCGA GRCh38** and **Cavatica**. 3. Click **Explore selected**. Note that by selecting **Cavatica**, you are selecting all paediatric cancer datasets available through the Platform for your query. If you want to query a subset of the paediatric cancer datasets, you have to select them individually. ###[ 2 ] Build the query 1. Click on the **Case** entity to select for patients which match your query parameters. 2. Select **Demographic** from the list of entities connected with the **Case** entity. 3. Click **+Add property** below **Demographic**. 4. Search for **female** and select **Gender: female**. 5. Select **File** from the list of entities connected with the **Case** entity. 6. Click **+Add property** below **File**. 7. Search for **rna-seq**, and select **Experimental strategy: RNA-seq**. 8. Select **Investigation** from the list of entities connected with the **File** entity. 9. Click **+Add property** below **Investigation**. 10. Search for **Leukaemia**, **Medulloblastoma**, **Rhabdoid**, and select **Disease type: Acute Myeloid Leukaemia**, **Medulloblastoma**, and **Atypical Teratoid Rhabdoid Tumor**, respectively. ###[ 3 ] Save the query and import the results to your project 1. Click **Save** from the **Queries** drop-down menu. 2. Name your query and add an optional description. 3. Click **Save query**. 4. [Import the query results to your project](doc:access-data-from-the-data-browser) Note that you will not be able to import restricted query result data without the right level of access. Files you cannot access are labelled with a red closed lock. This could occur, for example, with TCGA Controlled Data if you don't have permission from dbGaP. Read more about [accessing data from the Data Browser](doc:access-data-from-the-data-browser). That's it: you've successfully built a query across several datasets!