Archives

  • 2018-07
  • 2018-10
  • 2018-11
  • 2019-04
  • 2019-05
  • 2019-06
  • 2019-07
  • 2019-08
  • 2019-09
  • 2019-10
  • 2019-11
  • 2019-12
  • 2020-01
  • 2020-02
  • 2020-03
  • 2020-04
  • 2020-05
  • 2020-06
  • 2020-07
  • 2020-08
  • 2020-09
  • 2020-10
  • 2020-11
  • 2020-12
  • 2021-01
  • 2021-02
  • 2021-03
  • 2021-04
  • 2021-05
  • 2021-06
  • 2021-07
  • 2021-08
  • 2021-09
  • 2021-10
  • 2021-11
  • 2021-12
  • 2022-01
  • 2022-02
  • 2022-03
  • 2022-04
  • 2022-05
  • 2022-06
  • 2022-07
  • 2022-08
  • 2022-09
  • 2022-10
  • 2022-11
  • 2022-12
  • 2023-01
  • 2023-02
  • 2023-03
  • 2023-04
  • 2023-05
  • 2023-06
  • 2023-07
  • 2023-08
  • 2023-09
  • 2023-10
  • 2023-11
  • 2023-12
  • 2024-01
  • 2024-02
  • 2024-03
  • 2024-04
  • No G C pattern managed

    2022-04-01

    No G4C pattern managed to capture all GQ forming sequences of the reference dataset. The sequences that were missed by even the most flexible models were AGATGGAGTGGAGAGG, AGGAGATGCAGGAG, AGAGGGTAGATGG and TTTTTAAAAGAAGGGGAGGAATAGGGGATATGA. In fact, the former three are not expected to form any unimolecular G-quadruplex since they consist of only two G-tracts. Instead bimolecular assembly of two separate strands would be possible and explain the in vitro observation of GQ formation by these sequences. In that regard, it BMS 195614 receptor should be noted that G4C is suitable for only unimolecular G-quadruplex prediction since at least four consecutive G-tracts are necessary for detection. The latter of the four sequences could have been detected if the minimum loop length parameter was set to 0 instead of 1, in which the loop would consist of only a phosphate between two G-tracts. Since such a short loop should be regarded as an extreme case, it would be suitable to set only the extreme loop length minimum to 0. All the suggestions for different applications are based on the evaluation of the models using the reference dataset. However, this set is not a good representation of biological DNA or RNA sequences, yet it is the most extensive and the best characterised dataset available. For better evaluation of the algorithms larger collections of experimentally validated sequences are needed to extend, especially the negative set. Fortunately, recent developments provided new datasets in regard to GQ formation along the genome. To provide an alternative evaluation, we also compared the predictions of the G4C and G4H algorithms against experimentally observed G-quadruplex-forming regions in the whole human genome. The chosen model, G3 + E3 + I1B, showed improvement over G4H for optimal parameters. More importantly, unlike G4H, the choice of discrete parameters for G4C had less dramatic effect on the score. Unfortunately, none of the algorithms managed to show high accuracy or an F1-score above 0.65 for the experimentally validated G-quadruplex-forming genomic regions, leaving potential for further development.
    Acknowledgements This work was supported by The Scientific and Technological Research Council of Turkey (TUBITAK) [grant number 216S854].
    Introduction The G-rich single strand DNAs can form polymorphic G-quadruplex structures containing G-quartets stabilized by Hoogsteen hydrogen bonding.1, 2 The whole-genome sequencing experiments and bioinformatics research predict the G-quadruplex forming sequences in the region of human genome such as telomeres, as well as in promoter regions of oncogenes.3, 4 It is believed that the G-quadruplex DNAs play important roles as regulatory elements in many biological processes, especially in regulating gene transcription and translation, and then affected cell proliferation and cancer progression.5, 6, 7 However, up to date, the structures and functions of most G-quadruplex DNAs in genomes are difficult to predict purely. In this context, detection of G-quadruplex DNA structures both in vitro and in vivo is crucial for understanding of their persistence and biological roles. Fluorescence techniques provide the additional advantage of real-time monitoring of structure and biological functions of biomacromolecules in living cells, with high temporal and spatial resolution. There has been a substantial amount of effort in developing the small molecular fluorescence “light-up” probes to assess the formation and topology of G-quadruplex DNA structures.9, 10, 11, 12 In most cases, these probes like arylvinyl dyes are mostly nonfluorescent in solution alone, due to the rapid nonradiative decay which will be suppressed and give large fluorescence enhancement when binding with G-quadruplex DNAs, for example, through restriction of intramolecular rotation. Since the intramolecular movements of the probes may also be restricted by other nonG-quadruplex DNA forms (e.g. duplex), which will lead to a very poor selectivity. Thus, probes that are capable of detection of G-quadruplex DNAs with a sufficient selectivity over other DNA forms are high desired.