Quantcast
Channel: Latest Questions by aRookieBIdev
Viewing all articles
Browse latest Browse all 76

Database Design Help

$
0
0
I am stuck on a database design problem: trying to understand how best to design a database for the archiving I am trying to do. I am not an expert in this field and could use some ideas or suggestions. I will define my specifications as best as I can. Let me know if you need more details. to give you a background: we design primer libraries: each library is simply a set of regions we are focusing on. eg: library 1 can have 5 regions: chromosome 1 : position 1 to 10, chromosome 2: position 1 to 500 and so on and so forth. there is no limit to the number of regions a single library can include. we typically see about 1000-3000 regions in one library. ( next we run samples using these libraries. This means we take a sample and look at each region defined in the library of interest (to put it crudely). what I want to collect is performance metrics on how well a library performed across multiple samples. We have defined performance metrics which we can collect for each region within each primer library for as many samples as were run. what is the best way to store this information in a table / database structure? The simple one I can think of is having a single table for all libraries like below with these columns. Each sample will go in as a bunch of rows (one per region). SampleName AnalysisID LibraryName RegionID Region metric being tracked However, the main objective of creating this database is so that it is easy to look at "poorly" performing regions across different samples and determine consistency. for this, it would help if we had only one row per region of a particular library and each sample being a new column (somewhat like an excel table). But then, different samples get run for each library and therefore if sample names were columns, we would have a large number of columns growing quickly and lot of empty regions where sample 1 was run for library 1 but not for library 2. So does that mean I create a different table for each library? - Maybe. It just doesn't sounds right. Is there a better solution? thanks for reading this email. Your expertise will be much appreciated.

Viewing all articles
Browse latest Browse all 76

Trending Articles