Emap2sec+ is a computational tool using deep learning that can accurately identify structures, alpha helices, beta sheets, other(coils/turns) and DNA/RNA, in cryo-Electron Microscopy (EM) maps of medium to low resolution.
(1) Process cryo-EM map (*.mrc file) and change grid size to 1.
(2) Scan EM map to get voxel input and corrsponding locations and residue IDs(if with PDB structure) and save it in *.trimmap file.
(3) (Optional) Assign Structure labels by Stride (if with PDB structure).
(4) Generate *.input file which only includes voxel information and label information(if with PDB structure).
(5) Apply Phase1 Network and Phase2 Network to assign labels for each voxels and save the predictions in *pred.txt.
(6) Output the evaluation report in *report.txt (if with PDB structure).
(7) Output *.pdb and *.pml file to visualize predictions.
python3 main.py --mode=0 -F=[Map_path]
--type=1 --gpu=0 --class=4
pymol -u *.pml
In the Predict_Result_WithPDB/SIMU10/[Input_Map_Name], our evaluation report will be saved in *_report.txt.python3 main.py --mode=1 -F=[Map_path] ---P=[PDB_Path]
type=1 --gpu=0 --class=4
Here the precision is the fraction of correct predicted structures among the specific predicted structure, while recall (also known as sensitivity) is the fraction of the total amount of the specific structure that were actually retrieved. The F1 score is the harmonic mean of the precision and recall. The support is the number of voxels with the structure label. The macro measurement means macro-averaging(taking all classes as equally important), while the micro means mirco-averaging (biased by class frequency).
python3 main.py --mode=0 -F=[Map_path] –type=3 --gpu=0
--class=4 --fold=3 -–contour=0.006
pymol -u *.pml
In the Predict_Result_WithPDB/REAL/Fold3_Model_Result/[Input_Map_Name], our evaluation report will be saved in *_report.txt. Here is an example of our evaluation report of 6BJS.python3 main.py --mode=1 -F=[Map_path] ---P=[PDB_Path]
--type=3 --gpu=0 --class=4 -–fold=3 -–contour=0.006
Here the precision is the fraction of correct predicted structures among the specific predicted structure, while recall (also known as sensitivity) is the fraction of the total amount of the specific structure that were actually retrieved. The F1 score is the harmonic mean of the precision and recall. The support is the number of voxels with the structure label. The macro measurement means macro-averaging(taking all classes as equally important), while the micro means mirco-averaging (biased by class frequency).
1. First, you need to create an account on code ocean using academic credentials and login into your account.
2. Then, click on the links above and go to the desired code ocean capsule.
3. To make a reproducible run i.e. run the code on an example input provided by us, click on the "Reproducible Run" button in the top right corner. This will start running the code on our example, and the results will be generated in the results folder at the bottom right after the execution is complete.
4. To run the code on an input of your choice, first go to our capsule and click on the "Edit Capsule" button in the top right corner. This will make a copy of our capsule which you can edit. Follow the instructions about how to upload and run your input in this copied capsule by reading the readme file present in the respective code ocean capsules.
Other details specific to the respective capsules can be found in the readme files of the capsules. For more details on how to run code ocean capsule please visit: Code Ocean user documentation
Emap2sec+ is a free software for academic and non-commercial users.
It is released under the terms of the GNU General Public License Ver.3 (https://www.gnu.org/licenses/gpl-3.0.en.html).
Commercial users please contact dkihara@purdue.edu for alternate licensing.
Citation of the following reference should be included in any publication that uses data or results generated by Emap2sec+ program.
Xiao Wang, Eman Alnabati, Tunde W Aderinwale, Sai Raghavendra Maddhuri Venkata Subramaniya, Genki Terashi, & Daisuke Kihara. Detecting Protein and DNA/RNA Structures in Cryo-EM Maps of Intermediate Resolution Using Deep Learning. Nature Communications 12, 2302 (2021)
The simulated EM map dataset used in this paper is available at SIMU_MAPS
© 2024 KIHARA Bioinformatics LABORATORY, PURDUE University | Design by TEMPLATED.