request for an example command line

I am having trouble understanding the manual. I wish to show the side chain for a particular amino acid in my protein. What is the command line to do this? thank you James Raymond

Hi James, There are lots of ways to do it, but here is the command to show all atoms of residue 10 in chain A of model #1: show #1/A:10 You can be less specific, e.g.: show /A:10 ...meaning display residue 10 of chain A in all models (and maybe you only have one anyway) show :10 ...meaning display residue 10 in all chains and all models that have a residue 10 All of the command help pages include a usage line showing how to use the command, where "atom-spec" in the usage line links to this other page that explains all of the stuff I said above, and more (e.g. you could also use residue name, atomic element symbol, ...), including several examples: <https://rbvi.ucsf.edu/chimerax/docs/user/commands/atomspec.html> You can also click the link from the word "Usage" which explains the conventions in the command usage information. Once you understand the usage of the command, it is pretty simple, as all the options will be explained in the help. There are also many example commands in the tutorials <https://www.rbvi.ucsf.edu/chimerax/tutorials.html> ... for example, protein-ligand binding sites <https://www.rbvi.ucsf.edu/chimerax/docs/user/tutorials/binding-sites.html> ... delta-opioid receptor ligand bindin <https://www.rbvi.ucsf.edu/chimerax/data/delta-opioid-jan2020/binding.html> It is hard to answer a basic question like this completely because there are so many other ways to do it, however. For example, you could show the sequence of the protein and select the residue from there, or Ctrl-click on its ribbon segment in the graphics window to select it, and then use the Actions menu to show the atoms, or instead of the Actions menu, you could use the icon for atom display in the toolbar across the top. I hope this helps, Elaine ----- Elaine C. Meng, Ph.D. UCSF Chimera(X) team Department of Pharmaceutical Chemistry University of California, San Francisco
On May 8, 2023, at 1:02 PM, James Raymond via ChimeraX-users <chimerax-users@cgl.ucsf.edu> wrote:
I am having trouble understanding the manual. I wish to show the side chain for a particular amino acid in my protein. What is the command line to do this?
thank you James Raymond

Hi Elaine, thank you so much. it worked. I got bogged down in instructions that made me think I had to use "sidechain" or "side chain" in the command, and all I got was errors. JIm [image: image.png] On Mon, May 8, 2023 at 1:29 PM Elaine Meng <meng@cgl.ucsf.edu> wrote:
Hi James, There are lots of ways to do it, but here is the command to show all atoms of residue 10 in chain A of model #1:
show #1/A:10
You can be less specific, e.g.:
show /A:10
...meaning display residue 10 of chain A in all models (and maybe you only have one anyway)
show :10
...meaning display residue 10 in all chains and all models that have a residue 10
All of the command help pages include a usage line showing how to use the command, where "atom-spec" in the usage line links to this other page that explains all of the stuff I said above, and more (e.g. you could also use residue name, atomic element symbol, ...), including several examples: <https://rbvi.ucsf.edu/chimerax/docs/user/commands/atomspec.html>
You can also click the link from the word "Usage" which explains the conventions in the command usage information. Once you understand the usage of the command, it is pretty simple, as all the options will be explained in the help.
There are also many example commands in the tutorials <https://www.rbvi.ucsf.edu/chimerax/tutorials.html>
... for example, protein-ligand binding sites <https://www.rbvi.ucsf.edu/chimerax/docs/user/tutorials/binding-sites.html
... delta-opioid receptor ligand bindin <https://www.rbvi.ucsf.edu/chimerax/data/delta-opioid-jan2020/binding.html
It is hard to answer a basic question like this completely because there are so many other ways to do it, however. For example, you could show the sequence of the protein and select the residue from there, or Ctrl-click on its ribbon segment in the graphics window to select it, and then use the Actions menu to show the atoms, or instead of the Actions menu, you could use the icon for atom display in the toolbar across the top.
I hope this helps, Elaine ----- Elaine C. Meng, Ph.D. UCSF Chimera(X) team Department of Pharmaceutical Chemistry University of California, San Francisco
On May 8, 2023, at 1:02 PM, James Raymond via ChimeraX-users < chimerax-users@cgl.ucsf.edu> wrote:
I am having trouble understanding the manual. I wish to show the side chain for a particular amino acid in my protein. What is the command line to do this?
thank you James Raymond

Hi James, You're welcome! I was guessing that you were already showing ribbons, which (by default) prevents displaying backbone atoms at the same time, so that the commands I gave would show the sidechain in addition to the existing ribbon representation of the backbone. If you weren't showing the ribbon, the example commands I gave would show all of the atoms of the whole residue. If the ribbon wasn't shown and you really wanted to show only the side chain atoms of the residue and NOT the backbone atoms, you would use the option to show side chain only. That option appears in the Actions menu as in the image you just sent, or you can use the command-line specifier "sidechain" (includes CA atom) or "sideonly" (does not include CA atom) explained in the atomspec page: <https://rbvi.ucsf.edu/chimerax/docs/user/commands/atomspec.html#builtin> ...for example intersecting the "sidechain" specification with the chain and residue number specification: show /A:10 & sidechain I hope this helps, Elaine ----- Elaine C. Meng, Ph.D. UCSF Chimera(X) team Department of Pharmaceutical Chemistry University of California, San Francisco
On May 8, 2023, at 2:03 PM, James Raymond via ChimeraX-users <chimerax-users@cgl.ucsf.edu> wrote:
Hi Elaine, thank you so much. it worked. I got bogged down in instructions that made me think I had to use "sidechain" or "side chain" in the command, and all I got was errors. JIm <image.png>
On Mon, May 8, 2023 at 1:29 PM Elaine Meng <meng@cgl.ucsf.edu> wrote: Hi James, There are lots of ways to do it, but here is the command to show all atoms of residue 10 in chain A of model #1:
show #1/A:10
You can be less specific, e.g.:
show /A:10
...meaning display residue 10 of chain A in all models (and maybe you only have one anyway)
show :10
...meaning display residue 10 in all chains and all models that have a residue 10
All of the command help pages include a usage line showing how to use the command, where "atom-spec" in the usage line links to this other page that explains all of the stuff I said above, and more (e.g. you could also use residue name, atomic element symbol, ...), including several examples: <https://rbvi.ucsf.edu/chimerax/docs/user/commands/atomspec.html>
You can also click the link from the word "Usage" which explains the conventions in the command usage information. Once you understand the usage of the command, it is pretty simple, as all the options will be explained in the help.
There are also many example commands in the tutorials <https://www.rbvi.ucsf.edu/chimerax/tutorials.html>
... for example, protein-ligand binding sites <https://www.rbvi.ucsf.edu/chimerax/docs/user/tutorials/binding-sites.html> ... delta-opioid receptor ligand bindin <https://www.rbvi.ucsf.edu/chimerax/data/delta-opioid-jan2020/binding.html>
It is hard to answer a basic question like this completely because there are so many other ways to do it, however. For example, you could show the sequence of the protein and select the residue from there, or Ctrl-click on its ribbon segment in the graphics window to select it, and then use the Actions menu to show the atoms, or instead of the Actions menu, you could use the icon for atom display in the toolbar across the top.
I hope this helps, Elaine ----- Elaine C. Meng, Ph.D. UCSF Chimera(X) team Department of Pharmaceutical Chemistry University of California, San Francisco
On May 8, 2023, at 1:02 PM, James Raymond via ChimeraX-users <chimerax-users@cgl.ucsf.edu> wrote:
I am having trouble understanding the manual. I wish to show the side chain for a particular amino acid in my protein. What is the command line to do this?
thank you James Raymond
_______________________________________________ ChimeraX-users mailing list ChimeraX-users@cgl.ucsf.edu Manage subscription: https://www.rbvi.ucsf.edu/mailman/listinfo/chimerax-users

thank you. I used your commands to show these 3 Thr sidechains Adding "& sidechain" didn't seem to make a difference. [image: image.png] On Mon, May 8, 2023 at 2:38 PM Elaine Meng <meng@cgl.ucsf.edu> wrote:
Hi James, You're welcome!
I was guessing that you were already showing ribbons, which (by default) prevents displaying backbone atoms at the same time, so that the commands I gave would show the sidechain in addition to the existing ribbon representation of the backbone. If you weren't showing the ribbon, the example commands I gave would show all of the atoms of the whole residue.
If the ribbon wasn't shown and you really wanted to show only the side chain atoms of the residue and NOT the backbone atoms, you would use the option to show side chain only.
That option appears in the Actions menu as in the image you just sent, or you can use the command-line specifier "sidechain" (includes CA atom) or "sideonly" (does not include CA atom) explained in the atomspec page: <https://rbvi.ucsf.edu/chimerax/docs/user/commands/atomspec.html#builtin>
...for example intersecting the "sidechain" specification with the chain and residue number specification:
show /A:10 & sidechain
I hope this helps, Elaine ----- Elaine C. Meng, Ph.D. UCSF Chimera(X) team Department of Pharmaceutical Chemistry University of California, San Francisco
On May 8, 2023, at 2:03 PM, James Raymond via ChimeraX-users < chimerax-users@cgl.ucsf.edu> wrote:
Hi Elaine, thank you so much. it worked. I got bogged down in instructions that made me think I had to use "sidechain" or "side chain" in the command, and all I got was errors. JIm <image.png>
On Mon, May 8, 2023 at 1:29 PM Elaine Meng <meng@cgl.ucsf.edu> wrote: Hi James, There are lots of ways to do it, but here is the command to show all atoms of residue 10 in chain A of model #1:
show #1/A:10
You can be less specific, e.g.:
show /A:10
...meaning display residue 10 of chain A in all models (and maybe you only have one anyway)
show :10
...meaning display residue 10 in all chains and all models that have a residue 10
All of the command help pages include a usage line showing how to use the command, where "atom-spec" in the usage line links to this other page that explains all of the stuff I said above, and more (e.g. you could also use residue name, atomic element symbol, ...), including several examples: <https://rbvi.ucsf.edu/chimerax/docs/user/commands/atomspec.html>
You can also click the link from the word "Usage" which explains the conventions in the command usage information. Once you understand the usage of the command, it is pretty simple, as all the options will be explained in the help.
There are also many example commands in the tutorials <https://www.rbvi.ucsf.edu/chimerax/tutorials.html>
... for example, protein-ligand binding sites < https://www.rbvi.ucsf.edu/chimerax/docs/user/tutorials/binding-sites.html> ... delta-opioid receptor ligand bindin < https://www.rbvi.ucsf.edu/chimerax/data/delta-opioid-jan2020/binding.html>
It is hard to answer a basic question like this completely because there are so many other ways to do it, however. For example, you could show the sequence of the protein and select the residue from there, or Ctrl-click on its ribbon segment in the graphics window to select it, and then use the Actions menu to show the atoms, or instead of the Actions menu, you could use the icon for atom display in the toolbar across the top.
I hope this helps, Elaine ----- Elaine C. Meng, Ph.D. UCSF Chimera(X) team Department of Pharmaceutical Chemistry University of California, San Francisco
On May 8, 2023, at 1:02 PM, James Raymond via ChimeraX-users < chimerax-users@cgl.ucsf.edu> wrote:
I am having trouble understanding the manual. I wish to show the side chain for a particular amino acid in my protein. What is the command line to do this?
thank you James Raymond
_______________________________________________ ChimeraX-users mailing list ChimeraX-users@cgl.ucsf.edu Manage subscription: https://www.rbvi.ucsf.edu/mailman/listinfo/chimerax-users

Yes, as I tried to explain before, the ribbon suppresses the backbone atom display, so it doesn't make a difference when the ribbon is shown. If you hide the ribbon, it will reveal whether the backbone atoms are also shown or not. Best, Elaine
On May 8, 2023, at 3:49 PM, James Raymond via ChimeraX-users <chimerax-users@cgl.ucsf.edu> wrote:
thank you. I used your commands to show these 3 Thr sidechains Adding "& sidechain" didn't seem to make a difference.
<image.png>
On Mon, May 8, 2023 at 2:38 PM Elaine Meng <meng@cgl.ucsf.edu> wrote: Hi James, You're welcome!
I was guessing that you were already showing ribbons, which (by default) prevents displaying backbone atoms at the same time, so that the commands I gave would show the sidechain in addition to the existing ribbon representation of the backbone. If you weren't showing the ribbon, the example commands I gave would show all of the atoms of the whole residue.
If the ribbon wasn't shown and you really wanted to show only the side chain atoms of the residue and NOT the backbone atoms, you would use the option to show side chain only.
That option appears in the Actions menu as in the image you just sent, or you can use the command-line specifier "sidechain" (includes CA atom) or "sideonly" (does not include CA atom) explained in the atomspec page: <https://rbvi.ucsf.edu/chimerax/docs/user/commands/atomspec.html#builtin>
...for example intersecting the "sidechain" specification with the chain and residue number specification:
show /A:10 & sidechain
I hope this helps, Elaine ----- Elaine C. Meng, Ph.D. UCSF Chimera(X) team Department of Pharmaceutical Chemistry University of California, San Francisco
On May 8, 2023, at 2:03 PM, James Raymond via ChimeraX-users <chimerax-users@cgl.ucsf.edu> wrote:
Hi Elaine, thank you so much. it worked. I got bogged down in instructions that made me think I had to use "sidechain" or "side chain" in the command, and all I got was errors. JIm <image.png>
On Mon, May 8, 2023 at 1:29 PM Elaine Meng <meng@cgl.ucsf.edu> wrote: Hi James, There are lots of ways to do it, but here is the command to show all atoms of residue 10 in chain A of model #1:
show #1/A:10
You can be less specific, e.g.:
show /A:10
...meaning display residue 10 of chain A in all models (and maybe you only have one anyway)
show :10
...meaning display residue 10 in all chains and all models that have a residue 10
All of the command help pages include a usage line showing how to use the command, where "atom-spec" in the usage line links to this other page that explains all of the stuff I said above, and more (e.g. you could also use residue name, atomic element symbol, ...), including several examples: <https://rbvi.ucsf.edu/chimerax/docs/user/commands/atomspec.html>
You can also click the link from the word "Usage" which explains the conventions in the command usage information. Once you understand the usage of the command, it is pretty simple, as all the options will be explained in the help.
There are also many example commands in the tutorials <https://www.rbvi.ucsf.edu/chimerax/tutorials.html>
... for example, protein-ligand binding sites <https://www.rbvi.ucsf.edu/chimerax/docs/user/tutorials/binding-sites.html> ... delta-opioid receptor ligand bindin <https://www.rbvi.ucsf.edu/chimerax/data/delta-opioid-jan2020/binding.html>
It is hard to answer a basic question like this completely because there are so many other ways to do it, however. For example, you could show the sequence of the protein and select the residue from there, or Ctrl-click on its ribbon segment in the graphics window to select it, and then use the Actions menu to show the atoms, or instead of the Actions menu, you could use the icon for atom display in the toolbar across the top.
I hope this helps, Elaine ----- Elaine C. Meng, Ph.D. UCSF Chimera(X) team Department of Pharmaceutical Chemistry University of California, San Francisco
On May 8, 2023, at 1:02 PM, James Raymond via ChimeraX-users <chimerax-users@cgl.ucsf.edu> wrote:
I am having trouble understanding the manual. I wish to show the side chain for a particular amino acid in my protein. What is the command line to do this?
thank you James Raymond
_______________________________________________ ChimeraX-users mailing list ChimeraX-users@cgl.ucsf.edu Manage subscription: https://www.rbvi.ucsf.edu/mailman/listinfo/chimerax-users
_______________________________________________ ChimeraX-users mailing list ChimeraX-users@cgl.ucsf.edu Manage subscription: https://www.rbvi.ucsf.edu/mailman/listinfo/chimerax-users

I see. thanks again. On Mon, May 8, 2023 at 3:55 PM Elaine Meng <meng@cgl.ucsf.edu> wrote:
Yes, as I tried to explain before, the ribbon suppresses the backbone atom display, so it doesn't make a difference when the ribbon is shown. If you hide the ribbon, it will reveal whether the backbone atoms are also shown or not. Best, Elaine
On May 8, 2023, at 3:49 PM, James Raymond via ChimeraX-users < chimerax-users@cgl.ucsf.edu> wrote:
thank you. I used your commands to show these 3 Thr sidechains Adding "& sidechain" didn't seem to make a difference.
<image.png>
On Mon, May 8, 2023 at 2:38 PM Elaine Meng <meng@cgl.ucsf.edu> wrote: Hi James, You're welcome!
I was guessing that you were already showing ribbons, which (by default) prevents displaying backbone atoms at the same time, so that the commands I gave would show the sidechain in addition to the existing ribbon representation of the backbone. If you weren't showing the ribbon, the example commands I gave would show all of the atoms of the whole residue.
If the ribbon wasn't shown and you really wanted to show only the side chain atoms of the residue and NOT the backbone atoms, you would use the option to show side chain only.
That option appears in the Actions menu as in the image you just sent, or you can use the command-line specifier "sidechain" (includes CA atom) or "sideonly" (does not include CA atom) explained in the atomspec page: <https://rbvi.ucsf.edu/chimerax/docs/user/commands/atomspec.html#builtin
...for example intersecting the "sidechain" specification with the chain and residue number specification:
show /A:10 & sidechain
I hope this helps, Elaine ----- Elaine C. Meng, Ph.D. UCSF Chimera(X) team Department of Pharmaceutical Chemistry University of California, San Francisco
On May 8, 2023, at 2:03 PM, James Raymond via ChimeraX-users < chimerax-users@cgl.ucsf.edu> wrote:
Hi Elaine, thank you so much. it worked. I got bogged down in instructions that made me think I had to use "sidechain" or "side chain" in the command, and all I got was errors. JIm <image.png>
On Mon, May 8, 2023 at 1:29 PM Elaine Meng <meng@cgl.ucsf.edu> wrote: Hi James, There are lots of ways to do it, but here is the command to show all atoms of residue 10 in chain A of model #1:
show #1/A:10
You can be less specific, e.g.:
show /A:10
...meaning display residue 10 of chain A in all models (and maybe you only have one anyway)
show :10
...meaning display residue 10 in all chains and all models that have a residue 10
All of the command help pages include a usage line showing how to use the command, where "atom-spec" in the usage line links to this other page that explains all of the stuff I said above, and more (e.g. you could also use residue name, atomic element symbol, ...), including several examples: <https://rbvi.ucsf.edu/chimerax/docs/user/commands/atomspec.html>
You can also click the link from the word "Usage" which explains the conventions in the command usage information. Once you understand the usage of the command, it is pretty simple, as all the options will be explained in the help.
There are also many example commands in the tutorials <https://www.rbvi.ucsf.edu/chimerax/tutorials.html>
... for example, protein-ligand binding sites < https://www.rbvi.ucsf.edu/chimerax/docs/user/tutorials/binding-sites.html> ... delta-opioid receptor ligand bindin < https://www.rbvi.ucsf.edu/chimerax/data/delta-opioid-jan2020/binding.html>
It is hard to answer a basic question like this completely because there are so many other ways to do it, however. For example, you could show the sequence of the protein and select the residue from there, or Ctrl-click on its ribbon segment in the graphics window to select it, and then use the Actions menu to show the atoms, or instead of the Actions menu, you could use the icon for atom display in the toolbar across the top.
I hope this helps, Elaine ----- Elaine C. Meng, Ph.D. UCSF Chimera(X) team Department of Pharmaceutical Chemistry University of California, San Francisco
On May 8, 2023, at 1:02 PM, James Raymond via ChimeraX-users < chimerax-users@cgl.ucsf.edu> wrote:
I am having trouble understanding the manual. I wish to show the side chain for a particular amino acid in my protein. What is the command line to do this?
thank you James Raymond
_______________________________________________ ChimeraX-users mailing list ChimeraX-users@cgl.ucsf.edu Manage subscription: https://www.rbvi.ucsf.edu/mailman/listinfo/chimerax-users
_______________________________________________ ChimeraX-users mailing list ChimeraX-users@cgl.ucsf.edu Manage subscription: https://www.rbvi.ucsf.edu/mailman/listinfo/chimerax-users

Hi Elaine, I am writing up some results. Would you please see if I am describing our method correctly? Our protein has 2 domains, one whose structure is well known and for which many PDBs are available, and another that is novel. The PDB IDs below came from your file entitled af512_template_domain_names.json "Protein structure was predicted with AlphaFold2 (Jumper* et al.*, 2021) as implemented by UCSF ChimeraX v. 1.6rc (Pettersen* et al.*, 2021) with the option to use PDB templates. In practice, this meant that the N-terminal DUF3494 domain of the protein was predicted based on several existing PDB structures (7bwy_A, 7bwx_A, 3uyu_A, 6a8k_A, 3wp9_A, 4nu3_A, 4nu3_B, 6bg8_B, 4nu2_A, 3vn3_A, 6eio_A, 4nuh_A, 7dc5_B, 5uyt_A, 5uyt_B, 5uyt_A, 5uyt_B) and the C-terminal domain was predicted de novo by AlphaFold." Is it OK? thank you Jim

Hi Jim, We (ChimeraX developers) are not the AlphaFold developers and I personally haven't looked at the details of any of its output files, nor am I an expert on your particular protein of interest. So although your inference sounds reasonable, I can't say anything authoritative about it. ....except that you should say that the calculation is actually done with ColabFold, as mentioned in the documentation: <https://rbvi.ucsf.edu/chimerax/docs/user/tools/alphafold.html> Maybe a careful reading of the AlphaFold and ColabFold papers would help to confirm the rest of your description. Best, Elaine ----- Elaine C. Meng, Ph.D. UCSF Chimera(X) team Department of Pharmaceutical Chemistry University of California, San Francisco
On May 9, 2023, at 1:38 PM, James Raymond via ChimeraX-users <chimerax-users@cgl.ucsf.edu> wrote:
Hi Elaine, I am writing up some results. Would you please see if I am describing our method correctly? Our protein has 2 domains, one whose structure is well known and for which many PDBs are available, and another that is novel. The PDB IDs below came from your file entitled af512_template_domain_names.json
"Protein structure was predicted with AlphaFold2 (Jumper et al., 2021) as implemented by UCSF ChimeraX v. 1.6rc (Pettersen et al., 2021) with the option to use PDB templates. In practice, this meant that the N-terminal DUF3494 domain of the protein was predicted based on several existing PDB structures (7bwy_A, 7bwx_A, 3uyu_A, 6a8k_A, 3wp9_A, 4nu3_A, 4nu3_B, 6bg8_B, 4nu2_A, 3vn3_A, 6eio_A, 4nuh_A, 7dc5_B, 5uyt_A, 5uyt_B, 5uyt_A, 5uyt_B) and the C-terminal domain was predicted de novo by AlphaFold."
Is it OK?
thank you Jim

Hi Jim, As Elaine says, ChimeraX is using ColabFold, which is an optimized version of AlphaFold. So you should cite ColabFold (and AlphaFold and ChimeraX). Also your description of the template use is a bit misleading. First of all, AlphaFold only uses at most 4 PDB templates per chain. The templates you saw in the JSON file came from ColabFold. It is confusing, because AlphaFold will find up to 20 templates per chain, but then it only uses the 4 best scoring. I think that is always the first 4 of the 20. These details are from reading the AlphaFold code on GitHub. Unfortunately the actual workings of AlphaFold are poorly described in the AlphaFold publications. For instance, you will also see in the code that any template with more than 95% sequence identity is not used. That seems wrong, since you would of course want to use the best sequence matches, but I think it is a hold-over from the code that trained AlphaFold where they didn't want exact template matches. This is typical of AlphaFold code, it is shoddy work where DeepMind seemed mostly interested in showing off machine learning, with little concern for actually producing the best biology results. One last comment about the AlphaFold use of templates. In almost all cases AlphaFold more or less ignores the templates. In other words, it produces nearly the same structure whether it uses templates or not. From hundreds of runs on different sequences my experience has been that if AlphaFold gives a high confidence prediction (say mostly plddt > 80), then even if it uses an experimental structure as a template that differs, it will always prefer its predicted conformation over the experimental structure template. So only in cases where AlphaFold is not confident of its prediction will templates possibly help. Basically the template just is a starting structure, and if AlphaFold is at all confident in another conformation it converges to that other conformation and that template makes no difference. Tom ChimeraX AlphaFold developer
On May 9, 2023, at 2:06 PM, Elaine Meng via ChimeraX-users <chimerax-users@cgl.ucsf.edu> wrote:
Hi Jim, We (ChimeraX developers) are not the AlphaFold developers and I personally haven't looked at the details of any of its output files, nor am I an expert on your particular protein of interest. So although your inference sounds reasonable, I can't say anything authoritative about it.
....except that you should say that the calculation is actually done with ColabFold, as mentioned in the documentation: <https://rbvi.ucsf.edu/chimerax/docs/user/tools/alphafold.html>
Maybe a careful reading of the AlphaFold and ColabFold papers would help to confirm the rest of your description.
Best, Elaine ----- Elaine C. Meng, Ph.D. UCSF Chimera(X) team Department of Pharmaceutical Chemistry University of California, San Francisco
On May 9, 2023, at 1:38 PM, James Raymond via ChimeraX-users <chimerax-users@cgl.ucsf.edu> wrote:
Hi Elaine, I am writing up some results. Would you please see if I am describing our method correctly? Our protein has 2 domains, one whose structure is well known and for which many PDBs are available, and another that is novel. The PDB IDs below came from your file entitled af512_template_domain_names.json
"Protein structure was predicted with AlphaFold2 (Jumper et al., 2021) as implemented by UCSF ChimeraX v. 1.6rc (Pettersen et al., 2021) with the option to use PDB templates. In practice, this meant that the N-terminal DUF3494 domain of the protein was predicted based on several existing PDB structures (7bwy_A, 7bwx_A, 3uyu_A, 6a8k_A, 3wp9_A, 4nu3_A, 4nu3_B, 6bg8_B, 4nu2_A, 3vn3_A, 6eio_A, 4nuh_A, 7dc5_B, 5uyt_A, 5uyt_B, 5uyt_A, 5uyt_B) and the C-terminal domain was predicted de novo by AlphaFold."
Is it OK?
thank you Jim
_______________________________________________ ChimeraX-users mailing list ChimeraX-users@cgl.ucsf.edu Manage subscription: https://www.rbvi.ucsf.edu/mailman/listinfo/chimerax-users

Hi Tom, Very interesting. thank you. I will try it again without checking the PDB option to see if it makes a difference. Jim On Tue, May 9, 2023 at 2:25 PM Tom Goddard <goddard@sonic.net> wrote:
Hi Jim,
As Elaine says, ChimeraX is using ColabFold, which is an optimized version of AlphaFold. So you should cite ColabFold (and AlphaFold and ChimeraX).
Also your description of the template use is a bit misleading. First of all, AlphaFold only uses at most 4 PDB templates per chain. The templates you saw in the JSON file came from ColabFold. It is confusing, because AlphaFold will find up to 20 templates per chain, but then it only uses the 4 best scoring. I think that is always the first 4 of the 20. These details are from reading the AlphaFold code on GitHub. Unfortunately the actual workings of AlphaFold are poorly described in the AlphaFold publications. For instance, you will also see in the code that any template with more than 95% sequence identity is not used. That seems wrong, since you would of course want to use the best sequence matches, but I think it is a hold-over from the code that trained AlphaFold where they didn't want exact template matches. This is typical of AlphaFold code, it is shoddy work where DeepMind seemed mostly interested in showing off machine learning, with little concern for actually producing the best biology results. One last comment about the AlphaFold use of templates. In almost all cases AlphaFold more or less ignores the templates. In other words, it produces nearly the same structure whether it uses templates or not. From hundreds of runs on different sequences my experience has been that if AlphaFold gives a high confidence prediction (say mostly plddt > 80), then even if it uses an experimental structure as a template that differs, it will always prefer its predicted conformation over the experimental structure template. So only in cases where AlphaFold is not confident of its prediction will templates possibly help. Basically the template just is a starting structure, and if AlphaFold is at all confident in another conformation it converges to that other conformation and that template makes no difference.
Tom ChimeraX AlphaFold developer
On May 9, 2023, at 2:06 PM, Elaine Meng via ChimeraX-users < chimerax-users@cgl.ucsf.edu> wrote:
Hi Jim, We (ChimeraX developers) are not the AlphaFold developers and I personally haven't looked at the details of any of its output files, nor am I an expert on your particular protein of interest. So although your inference sounds reasonable, I can't say anything authoritative about it.
....except that you should say that the calculation is actually done with ColabFold, as mentioned in the documentation: <https://rbvi.ucsf.edu/chimerax/docs/user/tools/alphafold.html>
Maybe a careful reading of the AlphaFold and ColabFold papers would help to confirm the rest of your description.
Best, Elaine ----- Elaine C. Meng, Ph.D. UCSF Chimera(X) team Department of Pharmaceutical Chemistry University of California, San Francisco
On May 9, 2023, at 1:38 PM, James Raymond via ChimeraX-users < chimerax-users@cgl.ucsf.edu> wrote:
Hi Elaine, I am writing up some results. Would you please see if I am describing our method correctly? Our protein has 2 domains, one whose structure is well known and for which many PDBs are available, and another that is novel. The PDB IDs below came from your file entitled af512_template_domain_names.json
"Protein structure was predicted with AlphaFold2 (Jumper et al., 2021) as implemented by UCSF ChimeraX v. 1.6rc (Pettersen et al., 2021) with the option to use PDB templates. In practice, this meant that the N-terminal DUF3494 domain of the protein was predicted based on several existing PDB structures (7bwy_A, 7bwx_A, 3uyu_A, 6a8k_A, 3wp9_A, 4nu3_A, 4nu3_B, 6bg8_B, 4nu2_A, 3vn3_A, 6eio_A, 4nuh_A, 7dc5_B, 5uyt_A, 5uyt_B, 5uyt_A, 5uyt_B) and the C-terminal domain was predicted de novo by AlphaFold."
Is it OK?
thank you Jim
_______________________________________________ ChimeraX-users mailing list ChimeraX-users@cgl.ucsf.edu Manage subscription: https://www.rbvi.ucsf.edu/mailman/listinfo/chimerax-users

The predicted structure for the well-known domain was essentially the same when the PDB option was not selected. The output also didn't have the template_domain_names.json file, so it didn't appear to use existing PDB structures. Is it possible that it did use existing PDB structures without recording it in the results? , If it didn't, it's amazing that it got the same results without using the x-ray diffraction results. One difference was that not checking the PDB option caused the 2nd domain of the protein to rotate 90 degrees with respect to the first domain. Thank you for your help. Jim On Tue, May 9, 2023 at 2:47 PM James Raymond <raymond@unlv.nevada.edu> wrote:
Hi Tom, Very interesting. thank you. I will try it again without checking the PDB option to see if it makes a difference. Jim
On Tue, May 9, 2023 at 2:25 PM Tom Goddard <goddard@sonic.net> wrote:
Hi Jim,
As Elaine says, ChimeraX is using ColabFold, which is an optimized version of AlphaFold. So you should cite ColabFold (and AlphaFold and ChimeraX).
Also your description of the template use is a bit misleading. First of all, AlphaFold only uses at most 4 PDB templates per chain. The templates you saw in the JSON file came from ColabFold. It is confusing, because AlphaFold will find up to 20 templates per chain, but then it only uses the 4 best scoring. I think that is always the first 4 of the 20. These details are from reading the AlphaFold code on GitHub. Unfortunately the actual workings of AlphaFold are poorly described in the AlphaFold publications. For instance, you will also see in the code that any template with more than 95% sequence identity is not used. That seems wrong, since you would of course want to use the best sequence matches, but I think it is a hold-over from the code that trained AlphaFold where they didn't want exact template matches. This is typical of AlphaFold code, it is shoddy work where DeepMind seemed mostly interested in showing off machine learning, with little concern for actually producing the best biology results. One last comment about the AlphaFold use of templates. In almost all cases AlphaFold more or less ignores the templates. In other words, it produces nearly the same structure whether it uses templates or not. From hundreds of runs on different sequences my experience has been that if AlphaFold gives a high confidence prediction (say mostly plddt > 80), then even if it uses an experimental structure as a template that differs, it will always prefer its predicted conformation over the experimental structure template. So only in cases where AlphaFold is not confident of its prediction will templates possibly help. Basically the template just is a starting structure, and if AlphaFold is at all confident in another conformation it converges to that other conformation and that template makes no difference.
Tom ChimeraX AlphaFold developer
On May 9, 2023, at 2:06 PM, Elaine Meng via ChimeraX-users < chimerax-users@cgl.ucsf.edu> wrote:
Hi Jim, We (ChimeraX developers) are not the AlphaFold developers and I personally haven't looked at the details of any of its output files, nor am I an expert on your particular protein of interest. So although your inference sounds reasonable, I can't say anything authoritative about it.
....except that you should say that the calculation is actually done with ColabFold, as mentioned in the documentation: <https://rbvi.ucsf.edu/chimerax/docs/user/tools/alphafold.html>
Maybe a careful reading of the AlphaFold and ColabFold papers would help to confirm the rest of your description.
Best, Elaine ----- Elaine C. Meng, Ph.D. UCSF Chimera(X) team Department of Pharmaceutical Chemistry University of California, San Francisco
On May 9, 2023, at 1:38 PM, James Raymond via ChimeraX-users < chimerax-users@cgl.ucsf.edu> wrote:
Hi Elaine, I am writing up some results. Would you please see if I am describing our method correctly? Our protein has 2 domains, one whose structure is well known and for which many PDBs are available, and another that is novel. The PDB IDs below came from your file entitled af512_template_domain_names.json
"Protein structure was predicted with AlphaFold2 (Jumper et al., 2021) as implemented by UCSF ChimeraX v. 1.6rc (Pettersen et al., 2021) with the option to use PDB templates. In practice, this meant that the N-terminal DUF3494 domain of the protein was predicted based on several existing PDB structures (7bwy_A, 7bwx_A, 3uyu_A, 6a8k_A, 3wp9_A, 4nu3_A, 4nu3_B, 6bg8_B, 4nu2_A, 3vn3_A, 6eio_A, 4nuh_A, 7dc5_B, 5uyt_A, 5uyt_B, 5uyt_A, 5uyt_B) and the C-terminal domain was predicted de novo by AlphaFold."
Is it OK?
thank you Jim
_______________________________________________ ChimeraX-users mailing list ChimeraX-users@cgl.ucsf.edu Manage subscription: https://www.rbvi.ucsf.edu/mailman/listinfo/chimerax-users

Hi Jim, You can trust that ColabFold did not use any PDB templates. The fact that it did not output the template_domian_names.json file confirms that. It is not at all surprising that it got the the prediction right without templates -- if it has hundreds or thousands of sequences aligned and there are structures in the PDB that the neural net was trained on with similar folds then it doesn't need any templates. It is very surprising that your second domain position rotated 90 degrees. Very likely the AlphaFold prediction is not at all confident of the original packing or this new one. You should look at the PAE confidence estimates from AlphaFold to see if AlphaFold thinks the packing has any chance of being correct. https://www.rbvi.ucsf.edu/chimerax/data/pae-apr2022/pae.html Tom
On May 9, 2023, at 4:08 PM, James Raymond <raymond@unlv.nevada.edu> wrote:
The predicted structure for the well-known domain was essentially the same when the PDB option was not selected. The output also didn't have the template_domain_names.json file, so it didn't appear to use existing PDB structures. Is it possible that it did use existing PDB structures without recording it in the results? , If it didn't, it's amazing that it got the same results without using the x-ray diffraction results.
One difference was that not checking the PDB option caused the 2nd domain of the protein to rotate 90 degrees with respect to the first domain. Thank you for your help. Jim
On Tue, May 9, 2023 at 2:47 PM James Raymond <raymond@unlv.nevada.edu <mailto:raymond@unlv.nevada.edu>> wrote:
Hi Tom, Very interesting. thank you. I will try it again without checking the PDB option to see if it makes a difference. Jim
On Tue, May 9, 2023 at 2:25 PM Tom Goddard <goddard@sonic.net <mailto:goddard@sonic.net>> wrote:
Hi Jim,
As Elaine says, ChimeraX is using ColabFold, which is an optimized version of AlphaFold. So you should cite ColabFold (and AlphaFold and ChimeraX).
Also your description of the template use is a bit misleading. First of all, AlphaFold only uses at most 4 PDB templates per chain. The templates you saw in the JSON file came from ColabFold. It is confusing, because AlphaFold will find up to 20 templates per chain, but then it only uses the 4 best scoring. I think that is always the first 4 of the 20. These details are from reading the AlphaFold code on GitHub. Unfortunately the actual workings of AlphaFold are poorly described in the AlphaFold publications. For instance, you will also see in the code that any template with more than 95% sequence identity is not used. That seems wrong, since you would of course want to use the best sequence matches, but I think it is a hold-over from the code that trained AlphaFold where they didn't want exact template matches. This is typical of AlphaFold code, it is shoddy work where DeepMind seemed mostly interested in showing off machine learning, with little concern for actually producing the best biology results. One last comment about the AlphaFold use of templates. In almost all cases AlphaFold more or less ignores the templates. In other words, it produces nearly the same structure whether it uses templates or not. From hundreds of runs on different sequences my experience has been that if AlphaFold gives a high confidence prediction (say mostly plddt > 80), then even if it uses an experimental structure as a template that differs, it will always prefer its predicted conformation over the experimental structure template. So only in cases where AlphaFold is not confident of its prediction will templates possibly help. Basically the template just is a starting structure, and if AlphaFold is at all confident in another conformation it converges to that other conformation and that template makes no difference.
Tom ChimeraX AlphaFold developer
On May 9, 2023, at 2:06 PM, Elaine Meng via ChimeraX-users <chimerax-users@cgl.ucsf.edu <mailto:chimerax-users@cgl.ucsf.edu>> wrote:
Hi Jim, We (ChimeraX developers) are not the AlphaFold developers and I personally haven't looked at the details of any of its output files, nor am I an expert on your particular protein of interest. So although your inference sounds reasonable, I can't say anything authoritative about it.
....except that you should say that the calculation is actually done with ColabFold, as mentioned in the documentation: <https://rbvi.ucsf.edu/chimerax/docs/user/tools/alphafold.html>
Maybe a careful reading of the AlphaFold and ColabFold papers would help to confirm the rest of your description.
Best, Elaine ----- Elaine C. Meng, Ph.D. UCSF Chimera(X) team Department of Pharmaceutical Chemistry University of California, San Francisco
On May 9, 2023, at 1:38 PM, James Raymond via ChimeraX-users <chimerax-users@cgl.ucsf.edu <mailto:chimerax-users@cgl.ucsf.edu>> wrote:
Hi Elaine, I am writing up some results. Would you please see if I am describing our method correctly? Our protein has 2 domains, one whose structure is well known and for which many PDBs are available, and another that is novel. The PDB IDs below came from your file entitled af512_template_domain_names.json
"Protein structure was predicted with AlphaFold2 (Jumper et al., 2021) as implemented by UCSF ChimeraX v. 1.6rc (Pettersen et al., 2021) with the option to use PDB templates. In practice, this meant that the N-terminal DUF3494 domain of the protein was predicted based on several existing PDB structures (7bwy_A, 7bwx_A, 3uyu_A, 6a8k_A, 3wp9_A, 4nu3_A, 4nu3_B, 6bg8_B, 4nu2_A, 3vn3_A, 6eio_A, 4nuh_A, 7dc5_B, 5uyt_A, 5uyt_B, 5uyt_A, 5uyt_B) and the C-terminal domain was predicted de novo by AlphaFold."
Is it OK?
thank you Jim
_______________________________________________ ChimeraX-users mailing list ChimeraX-users@cgl.ucsf.edu <mailto:ChimeraX-users@cgl.ucsf.edu> Manage subscription: https://www.rbvi.ucsf.edu/mailman/listinfo/chimerax-users
participants (3)
-
Elaine Meng
-
James Raymond
-
Tom Goddard