Numerical Study on the Relationship between the Number of Ferritin Subunits and the Size of Outer Diameter

Ferritin is assembled from sub-chains into a spherical shape. We have found that the number of sub-chains is generally 3, 6, 8, 12, 24, 36, and the corresponding spherical volume (outer diameter) is also different. In this paper, using the protein structural data from PDB website, linear regression was car-ried out to model the relationship between sub-chain number and outer diameter of 25 ferritins by calculation and analysis, but the results were not ideal. In order to improve the model, we divided the 25 proteins into two groups by molecular evolution, followed by the establishment of a linear model. The correlation coefficient of one group was R 2 = 0.98. Based on cal-culations and modelling, we hypothesize that the new ferritin formed by the 3kx9 site-directed mutagenesis is no longer 24-chain, but a 36-chain mutant.

Advances in Bioscience and Biotechnology observed through the self-assembly of viral capsid, vault, heat shock and ferritin proteins [5]. In this paper, we specifically looked at ferritin which can be found ubiquitously in nature.
Ferritin plays a key role in iron detoxification and reserve, storing excess cellular iron as mineralized hydrous ferric oxide in their cavities. Ferritin proteinscan self-assemble into multi-subunit, nano-scale cages. They are also highly amenable through genetic and chemical modifications, which have attracted much recent attention in drug delivery studies and nanomaterial science [6]. In our previous research, site-directed mutagenesis on N-and C-termini of ferritin 3kx9 led to an increase in the volume of self-assembled protein cages [7]. In order to account for this observation, we will look into the relationship between the number of ferritin subunits and the size of its outer diameter in this paper.

Data
All protein structural data involved in this calculation was from the PDB [8] website (http://www.rcsb.org/). On the website, we identified all proteins that share more than 40% similarity to the amino acid sequence of 3kx9 provided by PDB. We found a total of 25 such proteins, including 3kx9. The 3D structure file (PDB format) and the sequence information file (fasta format) for each protein were downloaded.

Method
In our previously published paper [9], the results of the two calculation methods were compared. Due to the relatively large amount of calculation, the method based on the center of the sphere is used here. The paper [9] also gives a comparison of the calculation results of the two methods. Considering the reliability of the results, we calculated 80%, 85%, 90%, 95% of the maximum distance from the spherical center respectively. The average of the values was multiplied by 2 as the outer diameter. After obtaining the outer diameter data, we used linear regression to establish the relationship between the outer diameter of the 25 ferritin and the daughter strand.

Results
The outer diameter and the number of sub-chains of 25 ferritins are shown in Table 1. We used linear regression to establish the relationship between the outer diameter and the number of daughter strands of the 25 ferritins. Figure 2 shows the results of linear regression.
In Figure 2, the abscissa is the number of sub-chains, and the ordinate is the corresponding outer diameter. The correlation coefficient R 2 = 0.6469, indicating that the performance of the model is very limited. In particular, we can see that the outer diameter distribution of the 12 sub-chains is very large, ranging from about 140 to 210. Also, the outer diameter of the 24 sub-chains is found to be significantly lower than the expected value of the model.

Ways to Improve
We found that all 25 ferritins consisted of multiple daughter strands of an identical sequence. We used Maga [10] software to construct a phylogenetic tree for the sub-chains of 25 proteins. The results are shown in Figure 3.

The Problem That the Amino Acid Sequence Is Consistent and the Number of Sub-Chains Is Different
We found that the sequences of 3kx9 and 1s3q are identical, but the number of the two sub-chains is different. There are 24 sub-chains in 3kx9 and only 12     Interestingly, the number of sub-chains of 3kx9 is twice of 1s3q, but the volume of both is almost the same. Does this mean that the amino acid density of 3kx9 doubles that of 1s3q? This question deserves further exploration.

The Problem That the Mutant's Volume Becomes Larger
In their work, Williams and co-workers demonstrates that a mutation at a criti-  from the canonical 12-mer to a ferritin-like 24-mer under crystallization [11].
According to the size distribution curve of 36-mer ferritin (Figure 6), it may well be that an increase in diameter from 12 to 18 d•nm after site-directed mutagenesis on ferritin 3kx9 is due to a structural switch from 24-mer to 36-mer. If this speculation can be confirmed with experimental data, it introduces a new concept of mutational switch between related protein subfamilies.

Conclusion
In this paper, we discovered a close relationship between the number and type of