Overview

The Kidney Tissue Atlas Repository is intended to allow our users to search for and download data provided to us by our Tissue Interrogation Sites. In many cases, this data is the raw data, however, this also includes some more processed data as well as a limited set of clinical data. Not all of the data provided in the Atlas Repository is available to everyone, however. There is a set of controlled data that requires users to fill out a Data Use Agreement before they are allowed access.

Repositories

List of the repositories required to run this application.
Repository Description
portal-ui This was originally forked from the GDC Data Portal. We have since made modifications to brand it and make it functional for our use.
knowledge-environment (private repository) Contains schema for the data stores used in the Repository and Explorer
atlas-index-creation-worker The index creation worker will create the necessary ES indices needed for the repository from the data stored in the KE database
atlas-file-service Handles file downloads for the repository
heavens-docker This repository holds all of the docker containers and docker-compose scripts for running our applications

Data Stores

MariaDB
We use a MariaDB instance to store the data required to generate the ElasticSearch indices which feed the application. This database also holds information used to feed the Explorer. The database is constantly evolving, and the best place to find information on the structure is in the knowledge-environment repository.
ElasticSearch
The ElasticSearch indexes to feed the application are evolving quickly. The best place to find information on the structure is in the knowledge-environment repository.
AWS S3 Bucket
The files available for download from the Atlas Repository are stored in a private S3 bucket.