-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Port database loading scripts to AllianceMine #1120
Comments
Add documents to the Wiki. I wanted to take a break from it last week, so I didn't investigate PPI. I will do it this week. |
I updated the Wiki with all analyses for GRN and PPI and the proposed solutions for each. |
@ntran18’s PR to query the gene source rather than network in order to find ERT1 has been merged but appears to still not find ERT1 on the installed beta deployment. @ntran18 will investigate the issue Further @ntran18 will meet with @kdahlquist to clarify the updated queries on AllianceMine and other content in the wiki writeup |
The ERT1 issue should be fixed now. The issue was that when we updated the database, we forgot to update the database's GRN gene table. |
@kdahlquist https://www.alliancegenome.org/bluegenes/alliancemine/templates. Currently AllianceMine has some templates for Yeastmine. Could you please check over them? |
I looked at the Organism > All Genes query results from the link given in the above comment. This looks like what we need to retrieve all the genes in yeast. There are a couple of observations:
The Literature > Regulation template is what we need to use to get the network information.
One more thing to note. I looked at the downloaded results files and it looks like when a gene doesn't have a standard name, they leave that field blank. In that case, we need to copy the systematic name and put it in the standard name field. Let me know if you have more questions. |
Literature > Regulation is for the GRNs. For PPI, use Literature > Interaction and just use the "physical" interaction type. |
I will take a look at these. Thank you! |
@kdahlquist for GRN network table, does the regulators and target genes also have "ORF" feature type? |
Yes, they should. I don't see where the GRN network table actually includes an ORF field, but visual inspection of the table I can download from the web interface shows that they are all ORFs. One thing to note is that there is a "strain" field. We should only capture "S288c" data. We could explore in the future whether to make an option to filter by strain, but for now, we just want to capture the "S288c" data. |
ORFI modified the query to filter only the ORF feature type, which now returns 28,364 interactions compared to the original query, which returned 29,510 interactions. Could you please confirm which query I should be using? @kdahlquist
Database Update IssueI encountered an issue while attempting to update the protein standard name in the database. When trying to change the standard name in the protein table, an error occurs stating that the standard name is referenced in the physical_interactions table, which prevents the update. After discussing the issue with @dondi, we discovered that the schema for the
Missing
|
For the first question:
|
For the second question:
|
For the third question:
|
Question: are week keeping the "Annotation Type" field in our database for GRNs and/or PPI? If we are not, can we keep it? I can envision a feature request where users can select whether they want to look at just manually curated, just high-throughput, or both types of interactions on GRNs and PPIs that are loaded from the database. |
I looked at the schemas and have a question. I see on the PPI database schema, there is this in the
What does this mean? I think it means to exclude duplicate protein-protein interactions. However, by including both manually curated and high-throughput, there might actually be duplicates if an interaction was annotated in both ways. Also, would this same constraint be needed in the GRN database? |
My current implementation already filter out the duplicate interactions. So it doesn't care what annoted type is currently. Do you want to save duplicate interactions or is there a priority for selecting the preferred annotation type? |
We would need to save them. The user will choose from "manually curated only", "high throughput only" or "both manually curated and "high throughput". We wouldn't want to lose an interaction from either category. |
I edited the new schema for the database, but I have a few questions:
|
|
@ntran18 and @dondi discussed over the week and had the following conclusions:
|
After @ceciliazaragoza finishes work on #1150, she can join up with @ntran18 for the database work since this will be a heavier lift |
Now that InterMine is no more, we need to port our loading scripts to AllianceMine
The text was updated successfully, but these errors were encountered: