New Drug Development and Clinical Trial Design by Applying Genomic Information Management

Jeong-An Gim, Medical Science Research Center, College of Medicine, Korea University Guro Hospital, Seoul 08308, Korea

Young Kyung Ko, Division of Pulmonary, Allergy and Critical Care Medicine, Department of Internal Medicine, Korea University Guro Hospital, Seoul 08308, Korea

Depending on the patients’ genotype, the same drug may have different efficacies or side effects. With the cost of genomic analysis decreasing and reliability of analysis methods improving, vast amount of genomic information has been made available. Several studies in pharmacology have been based on genomic information to select the optimal drug, determine the dose, predict efficacy, and prevent side effects. This paper reviews the tissue specificity and genomic information of cancer. If the tissue specificity of cancer is low, cancer is induced in various organs based on a single gene mutation. Basket trials can be performed for carcinomas with low tissue specificity, confirming the efficacy of one drug for a single gene mutation in various carcinomas. Conversely, if the tissue specificity of cancer is high, cancer is induced in only one organ based on a single gene mutation. An umbrella trial can be performed for carcinomas with a high tissue specificity. Some drugs are effective for patients with a specific genotype. A companion diagnostic strategy that prescribes a specific drug for patients selected with a specific genotype is also reviewed. Genomic information is used in pharmacometrics to identify the relationship among pharmacokinetics, pharmacodynamics, and biomarkers of disease treatment effects. Utilizing genomic information, sophisticated clinical trials can be designed that will be better suited to the patients of specific genotypes. Genomic information also provides prospects for innovative drug development. Through proper genomic information management, factors relating to drug response and effects can be determined by selecting the appropriate data for analysis and by understanding the structure of the data. Selecting pre-processing and appropriate machine-learning libraries for use as machine-learning input features is also necessary. Professional curation of the output result is also required. Personalized medicine can be realized using a genome-based customized clinical trial design.

1. Background: Effective Clinical Trials Using Genomic Information

The human genome includes variants of DNA base sequences and epigenetic mutations, including changes in DNA methylation and histone acetylation patterns. Genomic variations, which include mutations in drug metabolism-related genes, can affect the pharmacokinetics, pharmacodynamics, efficacy, and safety of drugs [1,2]. This review describes the use of drug-related genomic information in drug development and clinical trial design. Personalized next-generation clinical trials, based on the individual genome, can be designed to maximize drug efficacy and minimize side effects.

With the rapid development of genome analysis technologies and computer performance, vast amounts of genomic information have been generated, which can be utilized in precision medicine. Representative cancer-related genomic information includes epidermal growth factor receptor (EGFR) mutation in non-small cell lung cancer (NSCLC) [3,4], ABL1 gene recombination in chronic myelogenous leukemia (CML) [5], and BRAF mutations in melanoma [6,7,8]. Genomic information related to cancer induction is available in databases, such as The Cancer Genome Atlas (TCGA) [9], Internal Cancer Genome Consortium (ICGC) [10], COSMIC [11], cBioPortal [12], OncoKB [13], MutaGene [14], and Cancer Genome Interpreter [15].

Open cancer genome data are generated using high-throughput technology. Cancer genome data published in TCGA include omics and clinical data of genomic variants, RNA-seq, and DNA methylation [9]. In cBioPortal, which is a rapid user-friendly platform, data on survival analysis related to variants, histological type, RNA-seq, or comparative analysis with methylation information are available [12]. The number of breast cancer patients is the largest in TCGA, in which variant information on 986 patients in 1097 clinical cases was reported. However, clinical data from TCGA do not include information about the drugs administered to the patients. There is no information on the effects and side effects of the prescribed drugs for each type of cancer. The effects of drugs can be estimated based on genomic information (mainly genomic variants) in individual clinical trials in “Drug Resistance” database [16] in COSMIC and CancerDR [17], and clinical trial results can be obtained from clinicaltrials.gov (accessed on 21 July 2022) [18].

The main objective of personalized medicine is to recommend treatment strategies and select drugs suitable for individuals based on their genomic information. Patient-specific clinical trials are necessary to realize the full potential of personalized medicine. Personal genomic information should be included in the eligibility criteria (EC) for clinical trials. The goal of this review is to provide indications on how to utilize genomic information in clinical trial designs and new drug developments. The points of this review can help prevent adverse drug reactions based on genetic information and find more effective patients.

2. Integrated Interpretation: Tissue Specificity and Environment of Cancer

Generally, carcinomas are caused by the accumulation of multiple genetic alteration in somatic cells, and tissue-specific frequencies of variants have been observed in various cancers. Tissue specificity of cancer is attributed to a variant of a specific cancer-related gene that causes organ-specific cancer [19,20]. Variants in cancer-related genes cause various types of carcinomas in different organs of the body. A genetic mutation in carcinomas with high tissue specificity results in the cancer occurring in a specific organ.

In carcinomas with low tissue specificity, a single gene mutation can cause carcinogenesis in various organs. The body consists of various cells, tissues, and organs, and each cell has the same genome sequence. The same genome sequence can perform diverse functions in different cells, depending on the changes in the epigenetic information and on various signaling mechanisms around the cell. Sensitivity to the cancer-causing factors also varies according to the cell type [19,21]. Thus, even in the presence of the same cancer-causing mutation, the probability of cancer occurrence may differ depending on the organ. Similarly, the tissue specificity of cancer can be explained by the representative examples listed below.

Mutations in the adenomatous polyposis coli (APC) gene are the cause of most familial adenomatous polyposis and colorectal cancer, but are rarely observed in other carcinomas [22]. Mutations in the Cadherin 1 (CDH1) gene are also a major cause of hereditary diffuse gastric cancer [23]. Mutations in the BRCA1 gene are mainly observed in carcinomas afflicting women, such as breast and ovarian cancer [24]. Typically, all patients with hairy cell leukemia (HCL) harbor variants in the BRAF gene [25]. Approximately 50% patients with melanoma and papillary thyroid carcinoma carry a variant of the BRAF gene [26,27]. On the other hand, approximately 10% of colorectal cancer patients harbor a variant of the BRAF gene [28].

In contrast, mutations in the TP53 gene confer low tissue specificity [29]. These mutations occur in most cancers, such as NSCLC, pancreatic ductal adenocarcinoma (PDAC), colorectal cancer, breast cancer, and ovarian cancer. The TP53 gene is involved in immune response and immunotherapy, and the wild-type p53 protein functions in mounting an adequate innate immune response. In cancer, mutant forms of the p53 protein act as a tumor antigen and induce a B-cell antibody response as well as a CD-8 killer T-cell response. In cancer immunotherapy, autoimmune and inflammatory responses, neurodegeneration, senescence, epigenetic instability, immune response, pathways, and therapeutic strategies targeting the TP53 gene and p53 protein have been discussed [29,30].

Tissue specificity of cancer is affected by various environmental factors such as metabolic abnormalities caused by diabetes or high blood pressure, infection (bacteria, viruses, parasites), and by immunocompetence [31,32,33]. These factors are classified as macroenvironments and microenvironments.

The tumor macroenvironment includes changes in body fat content, blood pressure, and blood sugar level caused by obesity, high blood pressure, and diabetes. A correlation between the tumor macroenvironment and the incidence of cancer has been reported. Pathologically and epidemiologically, the correlation patterns differ according to the type of carcinoma [34,35,36]. In diabetes, insulinemia and hyperglycemia are induced, initiating carcinogenesis. Insulinemia activates insulin-like growth factor signaling, and hyperglycemia supplies nutrients to the cancer cells, promoting acidification [33,36]. It also activates angiogenesis and cell proliferation signals via a chronic inflammatory response [36,37]. In obesity, an abnormal increase in the secretion of sex hormones derived from adipocytes, fibrosis of certain organs, and steatosis are also observed [38,39]. Furthermore, excessive cytokine secretion is induced owing to an abnormal inflammatory response, thereby increasing treatment resistance. Therefore, it is necessary to classify carcinomas according to the tumor macroenvironment influence.

The tumor microenvironment and oncogenic signaling are regulated by ligands that affect cell differentiation and receptors [40]. The organ-specific tissue differentiation is induced by stem cells in the adult tissues. This process is regulated by the epigenetic patterns of the cells constituting the tissues, self-renewal factors, and external factors [41]. Stem cell differentiation has different patterns depending on the tissue. Mesenchymal cells secret WNT proteins to maintain their stemness in the intestine. Epidermal interfollicular stem cells express their own WNT ligands for self-renewal [42]. Stem cells in colorectal cancers are maintained by secretion from activated myofibroblasts, whereas activated WNT-related signals accelerate cancer differentiation [43].

The cancer microenvironment can be explained by tumor heterogeneity. Cancer is a collection of malignant cells, cancer-associated fibroblasts, and tumor-associated macrophages, along with their ecosystem [44]. Cancer cells can be classified into infiltrating endothelial, hematopoietic, stromal, and other cell types, and their interactions have been studied [45,46]. Cancer cells evolved from a primary cancer, a concept known as cancer evolution [46,47]. Different cancer evolution patterns need to be observed accurately in individuals so that personalized treatment can be made available. Single-cell RNA-seq (scRNA-seq) is a technique that can be useful to better understand tumor heterogeneity. Specific gene expression levels for each cell type can be determined using scRNA-seq. Clustering and visualization techniques with dimensional reduction (t-SNE) for each cell type can be applied using scRNA-seq [48,49,50]. scRNA-seq has been applied in hepatocellular carcinoma [49], NSCLC [50], and primary breast cancer [48] and helps in the stratification and accurate classification of patients. This can maximize the sensitivity to appropriate drugs by understanding the pathways based on the status of the carcinoma.

The microenvironment of different cancers should also be considered in clinical trials. In hospitals, biopsies are performed on cancer patients, and are subjected to pathological analysis and genetic testing. Using a machine learning approach, patient information can be integrated to provide rapid and simple insights of clinical relevance. Owing to the increase in life expectancy and lack of exercise, complex variables related to chronic diseases affect the diagnosis and prognosis of cancer. Using a machine-learning approach, the patient information (genetic information, chronic disease status, and lifestyle) is pre-processed so that the machine-learning library can diagnose the disease. Then, cancer occurrence and prognosis-related factors can be processed using the machine-learning strategies, such as pattern recognition, classification, and visualization of results. Appropriate services have been suggested in clinical practice. Cancer type-wise genetic information is available in databases such as TCGA and COSMIC, and can be visually checked using cBioPortal [12], which is a user-friendly system, based on the patient information, providing an optimal treatment strategy is necessary. By analyzing these data, research and system developments can provide rapid and accurate insights for clinical decision-making.

3. Deposition, Application, and Indexing of Genomic Variation Information

In 2001, the Human Genome Project [51] resulted in the release of the human reference genome sequence, with new versions of the standard genome released in 2003, 2006, 2009, and 2013. Variants found in the standard genome and in other cancer patients have been stored in a database, with constant additions of information. Databases are used to predict cancer occurrence and to select a treatment strategy. For example, cancer genome projects such as TCGA [52] and ICGC [10] are publicly available, and the results of genome analysis for various diseases as well as cancer are deposited at National Center of Biotechnology Information (NCBI) Gene Expression Omnibus (GEO). However, these databases present challenges to clinical applications of datasets. To solve these challenges, web services that summarize the analysis tools and results related to cancer genomes have been developed. One of them, cBioPortal [12], allows users to search and analyze various cancer genome data in a user-friendly manner. Various data, such as genome, transcriptome, epigenome, and proteome data, obtained from cancer-derived tissues or cell lines, are collated and organized. These data are then annotated, curated, and indexed to allow researchers to analyze them. (Epi) Genetic changes according to cancer-derived samples, cancer-related genes, and signal transduction information can be observed, visualized, and linked to clinical information.

Distinguishing somatic and germline variants is important in identifying cancer-related variants. In germline variants, different patterns appear by race [53,54]. Therefore, for preventive medicine, it is critical to determine the proportion of cancer-related germline variants. Thus, in the 1000 Genomes Project, the ratio of variants by race was determined [55]. Associations between somatic and germline variants in several carcinomas have been confirmed using TCGA information [56,57,58]. This indicated the interaction of germline-somatic variants in tumorigenesis and assisted in understanding the mechanisms of cancer risk variants. The most representative cancer somatic variants database is COSMIC [59]. Most of the known mechanisms that induce carcinoma development include somatic variants. Since COSMIC was released in 2004, there have been rapid developments in next-generation sequencing (NGS) technology, computer analysis performance, and throughput. Although COSMIC has been modified for approximately 20 years, a reference database for appropriate comparisons with variants obtained in the clinical field is needed. Information exchange with external resources such as Ensembl, HGNC, and RefSeq is also insufficient; however, this is expected to be resolved by upgrading the annotation system [59]. The two-hit hypothesis and the optimal treatment strategy can overcome the limitations of COSMIC and maximize its advantages.

In the two-hit hypothesis proposed by Knudson in 1971, “in the dominantly inherited form, one mutation is inherited via germinal cells and the second occurs in somatic cells. In the non-hereditary form, both mutations occur in the somatic cells” [60]. There are continuing debates regarding the role of somatic and germline variants in the development of carcinoma. Whole-exome sequencing analysis of autism patients and their families revealed that the number of de novo variants in germline cells increased with age [61]. This was confirmed in a deCODE genetics study conducted in Iceland [62]. In summary, both somatic and germline variants play important roles in carcinogenesis.

An optimal treatment strategy is one that is based on the integrated information regarding the somatic and germline mutations, age, lifestyle, and the clinical information of the patient [63,64]. To develop a patient-specific somatic-germline variant-based treatment strategy, genomic information, along with patient information, must be collected longitudinally. The collection of the family history of the patient and testing for germline variants in the family are also important [65]. Recently, an integrated treatment strategy using machine learning and a prognosis prediction strategy were presented [66,67]. The efforts and databases to realize personalized medicine are described below.

The germline/somatic variant subcommittee, a multidisciplinary research committee of Clinical Genome Resource (ClinGen), was established in 2013 [68]. Somatic-germline zygosity is an algorithm for predicting the homozygous versus heterozygous variants and those of somatic versus germline origin, and was introduced by utilizing the sequence information obtained from the carcinoma samples. Modeling with allele frequency (AF), sequencing depth, tumor ploidy, and local copy number as inputs can assist in clinical decision making [69]. In another study, germline and somatic variants were analyzed and a model of cancer occurrence with age was generated, confirming that germline variants cause early-onset cancer, whereas somatic variants induce late-onset cancer [70]. For future generation of data that can be used for cancer diagnosis and clinical decisions, a two-hit strategy is needed to simultaneously analyze somatic and germline variants in tumor tissue and blood. Genomic data from various carcinomas or races have been collected that will serve as a basis for future cancer diagnosis and treatment strategies.

To select genomic variants for clinical trials, it is important to determine the ratio of variants by race. The cohort project carried out in several countries and the UK Biobank project are representative examples. Using the ratio of variants in each nation, it is possible to determine the variants crucial for the occurrence of a specific cancer. Large-scale cohort studies conducted in several countries have been listed in Table 1. Health and genetic indicators found in specific races for diseases, including cancer, can be explained using national variant data and risk factors. Rare variants showing a race-specific pattern can explain the genetic contribution to disease development, unlike common or de novo variants [71].

PharmGKB [72] and DrugBank [73] provide information relating to pharmacogenomics and the associations between known genotypes and drugs. These databases suggest that resistance and sensitivity are related to drug responses in clinical trials. Large volumes of genomic information related to drug responses have been produced and evidence for the clinical use of drugs has been presented [17,74,75].

Data on cancer-related gene expression and variants are provided in datasets such as the NCBI GEO and ArrayExpress. These data can be used preliminarily to identify variants and gene expression related to drug sensitivities and their side effects. For example, when “cancer drug sensitivity resistance” is queried in GEO, GSE102787 dataset is highlighted. Using GEO2R, researchers can select genes that are differentially expressed based on drug sensitivities. The omics information based on differences between two groups proposes the possibility of its clinical application.

To date, omics data based on the results of many clinical trials related to drug sensitivity and resistance have been published and further work is ongoing. Genomic and clinical data related to clinical trials are big data, and further processing is required to make clinical decisions under the regulation of bioethics laws. Moreover, appropriate indexing and cleaning processes for the stored and collected data are required. Thus, when a researcher uses the stored data, incorrect decisions can be prevented through excluding unnecessary or unstandardized data. In addition, data modeling and decision curation are required (Figure 1).

Table 1. Large-scale cohort cases for the realization of precision medicine by nation.

New Drug Development

Figure 1. Processing strategy for genomic and clinical data. Data collected have to be stored, indexed, and cleaned for use at a later stage. Data modeling and curation are shown for the clinical decision system. All processes must be performed under the regulation of the bioethics law.

4. Basket and Umbrella Trials

The development of genome analysis technology has enabled integrated analysis of various carcinomas. Advances in cancer research can now help to identify cancers with the shared biological mechanisms in different anatomical locations as well as cancers with different biological pathways in the same anatomical location. These advancements have transformed the paradigm that cancers originating from different anatomical organs have different biological mechanisms. Thus, the new cancer classification is based on molecular, cellular, and signaling mechanisms. Although derived from different anatomical organs, cancers with the same signaling mechanism may be subjected to the same treatment strategy. A drug modulating a specific signaling mechanisms can be applied to cancers originating from various anatomical organs that share that particular signaling mechanism; this clinical trial strategy is called a basket trial. Conversely, cancers originating from the same anatomical organ can be caused by different signaling mechanisms. Drugs that control different signaling mechanisms in the same cancer can be administered in a strategy, called an umbrella trial. These clinical trial strategies can help optimize the efficacy of new drugs (Figure 2).

Clinical Trial Design

Figure 2. Basket and umbrella trials.

In basket clinical trials, a single anticancer drug is tested against various carcinomas harboring the same genetic variant, whereas in umbrella clinical trials, several anticancer drugs are tested against a single carcinoma according to various genetic variants [84]. An example of a basket clinical trial is the phase II clinical trial of vemurafenib in 122 patients harboring a BRAF V600E mutation [85]. Vemurafenib, an inhibitor of BRAF V600 kinase, has been established to treat various carcinomas by an appropriate basket trial. Prior to the development of basket clinical trials, vemurafenib was effective in approximately 50% of patients with metastatic melanoma harboring the BRAF V600E mutation.

In various genomic information-based cancer studies, such as those based on TCGA, the BRAF V600E mutation was found in various cancers, such as NSCLC and colorectal cancer. Therefore, a basket clinical trial was conducted in patients harboring BRAF V600E mutations and cancers in tissues other than those of melanoma patients. This basket trial consisted of a total of 6 + 1 cohorts, with cohorts of patients with six types of cancer: NSCLC, ovarian cancer, colorectal cancer, cholangiocarcinoma, breast cancer, and multiple myeloma. The cancer progressed in the cohorts of patients with different types of cancer. Additionally, it progressed in patients with Erdheim–Chester disease and Langerhans cell histiocytosis. The results showed that the efficacy of vemurafenib was not the same in all cancers, with a response rate of 42% in NSCLC and 43% in Erdheim–Chester disease or Langerhans cell histiocytosis.

Unlike basket clinical trials, umbrella clinical trials test various treatment methods on the same carcinoma. Umbrella clinical trials can screen various treatment methods for a patient group or carcinoma for which there is no clear biomarker.

5. Companion Diagnosis: From the Genomics Point of View

Companion diagnosis (CDx) is a diagnostic method or diagnostic tool that is a “companion” for selecting disease-causing factors for targeted therapy [86]. Only diagnostic methods permitted by regulatory agencies can be used for specific targeted therapeutics. Clinical validity of the diagnostic and treatment methods used in CDx must be confirmed through clinical trials [87]. The CDx cases are presented in Table 2.

To use diagnostic tools for specific therapeutic agents, clinical evidence for interpreting diagnostic results must be considered [86]. Evidence-based recommendations are available for selecting drugs and clinical methods. The technical and environmental factors of different laboratories must also be correlated [88]. In 2014, the US Food and Drug Administration presented guidelines for mandatory CDx when developing targeted therapies. Similarly, in 2015, the Korean Ministry of Food and Drug Safety announced the ‘Guidelines for Approval and Review of In Vitro Companion Diagnostic Devices’.

Various factors can induce carcinoma formation, such as breast cancer, colorectal cancer, lung cancer, stomach cancer, pancreatic duct cancer, and melanoma. However, the same factors (EGFR or TP53 genes) also cause carcinoma in various organs [3,4,29,30]. Therefore, if the underlying mechanism is the same, a common therapeutic strategy can be used.

In the 1970s, the therapeutic effect of tamoxifen (Nolvadex), a breast cancer therapeutic agent, varied depending on the status of the estrogen receptor (ER) expression in patients with breast cancer. In the 1980s, it became known that the therapeutic effect on breast cancer varied depending on the HER2 gene mutation. Trastuzumab (Herceptin), a HER2 antagonist, was developed in the 1990s. As the therapeutic effect differs among patients depending on their genotype, considering patient genotype while selecting a specific drug and establishing a treatment strategy has attracted attention. In the 2000s, the research findings on signal transduction of cancer-causing factors were evaluated, and drugs that inhibit mutation-induced cancer-causing factors were developed. Representative drugs include gefitinib (Iressa) and erlotinib (Tarceva), which inhibit EGFR signaling, and imatinib (Gleevec), which is used for CML treatment [89]. These targeted anticancer agents selectively detect and inhibit specific targets expressed in the cancer cells. Therefore, the therapeutic effect is improved with reduced side effects (Figure 3).

Figure 3. Examples of basket trials in EGFR.

In the 2010s, CDx was used for the development of an immune checkpoint inhibitor. Ipilimumab (Yervoy), approved as the first immune checkpoint inhibitor in 2011, inhibits the activity of CTLA-4, which is expressed on the surface of T cells and suppresses their function. In the mid-2010s, drugs inhibiting PD-1, which plays a similar role as CTLA-4, were developed. Pembrolizumab (Keytruda) and nivolumab (Opdivo) selectively inhibit PD-1 in NSCLC and melanoma. These immune checkpoint inhibitors maximize T cell activity by inhibiting suppressing molecules, such as CTLA-4, PD-1, and PD-L1. In the case of pembrolizumab and nivolumab, health insurance is offered in Korea if patients having stage IIIB or higher disease test positive for PD-L1 expression and who have not responded to previous platinum-based chemotherapy without treatment with a PD-1 inhibitor (Figure 3).

The targeted cancer drugs discussed in this paper act only on cancer cells with specific biomarkers, and if used in individuals without the specific targets, they can cause side effects. Therefore, a process of detecting a specific target based on the patient’s genetic or clinical information is necessary. In this case, the regulatory body must approve the process of screening the specific target.

To promote CDx, each entity involved in new drug development requires that pharmaceutical companies need to personalize clinical trial designs based on the patients’ genotypes. Diagnostic companies will need to discover factors related to cancer-causing signaling from the results of basic science research, such as cellular- and molecular-level signaling mechanisms, and design a method to rapidly and accurately screen them.

Regulators will need to strengthen the supervision, direction, and guidance of efficient and safe patient-specific clinical trial designs. Health insurance entities will also need to optimize an appropriate fee for diagnosis and examination to use a specific drug and pay according to efficacy. CDx can present clinical evidence for the use of drugs for a specific target and can increase cancer treatment efficiency by applying personalized treatments to patients. Additionally, it can contribute to the financial security of the National Health Insurance by reducing the indiscriminate or incorrect use of targeted anticancer drugs.

Table 2. Cases of companion diagnosis.

6. Genomic Information and Pharmacometrics

Pharmacometrics identifies and predicts the relationship among pharmacokinetics, pharmacodynamics, biomarkers, and therapeutic properties through mathematical and statistical models. The interaction between drugs and patients is quantitatively analyzed by constructing and simulating a mathematical model to assess the effects of treatment and adverse effects according to drug concentration. This is intended to accurately identify the drug exposure–response relationship of individuals and groups by reflecting individual differences, intra-individual variability, and various errors.

In econometric pharmacology, parameters relating to pharmacology, physiology, and pathology are used. Recently, genotypes or epigenotypes have also been included as parameters. Systems pharmacology analyzes the diversity of individual drug responses by synthesizing these parameters, thus enabling a holistic approach to determine drug responses by parsing the various elements constituting individual drug responses. The systems pharmacology was developed by the following three factors: the increasing number of samples with well-analyzed patient characteristics, the development of omics technology, and the increasing analysis networks based on omics data.

Genotyping of high-throughput sequencing results obtained using DNA chips or NGS techniques is necessary for subjects participating in clinical trials. If the phenotype is considered safe and efficacious, the related genotype should be extracted, thereby elaborating on the ramifications of the patient group according to genotype. For statistical processing and machine learning analysis, patient-specific labeling should be accurately performed, and the individual characteristics of the patients should be well described. Recent evidence suggests that both genetic and epigenetic factors, such as gene expression and DNA methylation, are related to drug responses. Correlation, eQTL, and multi-omics approaches can be used to extract relevant parameters related to drug response.

Attempts to incorporate genotypes into drug response modelling are ongoing. In a clinical trial of simvastatin, modeling using seven genotypes known to be related to drug metabolism was attempted [114]. Modeling was attempted using the change in DNA methylation level caused by the EGFR inhibitor gefitinib as a parameter, and it was confirmed that epigenotypes, such as DNA methylation patterns, can also be the subject of modeling [115].

7. Challenge: Genomic Information Management

Peter Drucker said, “If you can’t measure it, you can’t manage it [116].” Currently, technologies that can measure genomic information have been developed [117]. Hospitals are accumulating patient-derived NGS data for the diagnosis and selection of appropriate drugs or treatments [118]. A system for the quality control of the measured results and the supervision of the regulatory agency on the results was established. Although the results of treatment progress along with patient-derived laboratory data are accumulated along with the genotype, it is necessary to establish a system for decision-making and selecting appropriate treatment strategies for specific diseases in actual clinical practice. An optimal management strategy that includes appropriate storage, indexing, and ethical considerations for accumulating genomic information is required. The topic of the “genomic information management” strategy requires further evaluation (Figure 4) [119].

Genomic Information

Figure 4. A comprehensive model of genomic information management with clinical data. Two omics data, gene expression and DNA methylation patterns could be changed by aging. The genomic data and clinical data of an individual are continuously collected over time. We aim to develop a model that can predict disease prediction, provide appropriate lifestyle habits, or present evidence that can be used in clinical practice by discovering genomic data that predicts changes in health status based on the collected data and applying machine learning to each data. Strategies for presenting insights based on patient-derived genomic information. Hospitals track and accumulate clinical information for chronic disease patients. Clinical information explains the maintenance of health, deterioration of the health state, and recovery of health over time. Integrate clinical and genomic information to find factors related to maintaining healthy states. The optimal combination is presented through machine learning, disease detection, lifestyle suggestion, and clinical decision basis.

The primary goal for implementing the genomic information management strategy is to obtain the necessary insights to perform research on existing public omics data (Table 3). Omics data, such as in NCBI GEO and ArrayExpress, require data analysis, visualization, and extraction insights. The web-based databases presented in Table 3 can be used, and the genome information can be appropriately managed using machine learning.

Table 3. Cases of extracting new insights through public omics data.

Machine learning is used to discover rules from data, recognize patterns, and classify them based on the characteristics of the data content [66,67]. In order for the machine-learning analysis library to recognize the data well, the structuring of the data (categorical, continuous, and ranked) and pre-processing should be performed initially. To extract factors relating to optimal clinical trials and the safety and efficacy of personalized drugs, a clear definition of the data structure is necessary. It is necessary to properly classify the genotype into categorical and phenotypic information on pharmacokinetic parameters, efficacy, and safety into categorical and continuous types and accurately predict the structure of the data to be performed.

Traditionally, hospitals have obtained clinical laboratory data and disease diagnosis results from patients. Recently, NGS-based genetic information from patients and image information, such as from PET and CT, have been stored in the hospital’s computer network in a common data model. These data are appropriate for selecting personalized medicine and patient-specific treatment strategies. However, the structure and characteristics of each data must be accurately understood and used as input features for the machine learning library. It is also necessary to select appropriate machine learning library inputs that present the optimal treatment strategy as the output. Patient information in hospitals is personal, and regulatory agencies and IRB reviewers must be confident that the research and clinical trial design protect patient privacy.

8. Perspectives and Conclusions

A large amount of genetic information can be quickly retrieved, and patient-derived clinical data can be stored in hospitals. Machine learning techniques are becoming more sophisticated for discovering combinations, recognizing patterns, and classifying clinical data. Computer performance and data storage are improving. These data can assist with developing new drugs and designing optimal clinical trials. In this review, new drug development and clinical trial designs using genomic information are discussed. The three most important points are as follows: firstly, the appropriate clinical data for analysis must be selected, and the structure of the data must be understood; second, a machine learning input feature and a machine learning library should be selected as inputs; third, appropriate curation of the output result is required.

In the future, hospitals will continue to accumulate patient-derived genomic and clinical data, and advances in computer performance and sophisticated machine learning libraries will continue. Collaborative research with research institutes and companies that can analyze the data accumulated in hospitals is necessary. Appropriate access to anonymized patient information and legal regulations and measures to protect patients’ personal information are required. Thus, patient-specific treatment will become increasingly sophisticated, the effects of treatment will increase, and the side effects of treatment will continue to decrease.

Author Contributions

Conceptualization, Y.K.K. and J.-A.G.; formal analysis, Y.K.K.; investigation, Y.K.K.; data curation, J.-A.G.; writing—original draft preparation, Y.K.K. and J.-A.G.; writing—review and editing, J.-A.G.; supervision, J.-A.G.; project administration, J.-A.G.; funding acquisition, J.-A.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health & Welfare, Republic of Korea (grant number: HI21C0012), and the National Research Foundation (NRF) funded by the Ministry of Education (grant number: NRF-2020R1I1A1A01052701).

References

1.   Wang, L.; McLeod, H.L.; Weinshilboum, R.M. Genomics and drug response. N. Engl. J. Med. 2011, 364, 1144–1153. [Google Scholar] [CrossRef] [PubMed]
2.   Zanger, U.M.; Schwab, M. Cytochrome P450 enzymes in drug metabolism: Regulation of gene expression, enzyme activities, and impact of genetic variation. Pharmacol. Ther. 2013, 138, 103–141. [Google Scholar] [CrossRef] [PubMed]
3.   Jänne, P.A.; Yang, J.C.-H.; Kim, D.-W.; Planchard, D.; Ohe, Y.; Ramalingam, S.S.; Ahn, M.-J.; Kim, S.-W.; Su, W.-C.; Horn, L. AZD9291 in EGFR inhibitor–resistant non–small-cell lung cancer. N. Engl. J. Med. 2015, 372, 1689–1699. [Google Scholar] [CrossRef]
4.   Riely, G.J.; Pao, W.; Pham, D.; Li, A.R.; Rizvi, N.; Venkatraman, E.S.; Zakowski, M.F.; Kris, M.G.; Ladanyi, M.; Miller, V.A. Clinical course of patients with non–small cell lung cancer and epidermal growth factor receptor exon 19 and exon 21 mutations treated with gefitinib or erlotinib. Clin. Cancer Res. 2006, 12, 839–844. [Google Scholar] [CrossRef] [PubMed]
5.   Radich, J.P.; Kopecky, K.J.; Appelbaum, F.R.; Kamel-Reid, S.; Stock, W.; Malnassy, G.; Paietta, E.; Wadleigh, M.; Larson, R.A.; Emanuel, P. A randomized trial of dasatinib 100 mg versus imatinib 400 mg in newly diagnosed chronic-phase chronic myeloid leukemia. Blood J. Am. Soc. Hematol. 2012, 120, 3898–3905. [Google Scholar] [CrossRef] [PubMed]
6.   Grob, J.J.; Amonkar, M.M.; Karaszewska, B.; Schachter, J.; Dummer, R.; Mackiewicz, A.; Stroyakovskiy, D.; Drucis, K.; Grange, F.; Chiarion-Sileni, V. Comparison of dabrafenib and trametinib combination therapy with vemurafenib monotherapy on health-related quality of life in patients with unresectable or metastatic cutaneous BRAF Val600-mutation-positive melanoma (COMBI-v): Results of a phase 3, open-label, randomised trial. Lancet Oncol. 2015, 16, 1389–1398. [Google Scholar] [PubMed]
7.   Robert, C.; Grob, J.J.; Stroyakovskiy, D.; Karaszewska, B.; Hauschild, A.; Levchenko, E.; Chiarion Sileni, V.; Schachter, J.; Garbe, C.; Bondarenko, I. Five-year outcomes with dabrafenib plus trametinib in metastatic melanoma. N. Engl. J. Med. 2019, 381, 626–636. [Google Scholar] [CrossRef]
8.   Robert, C.; Karaszewska, B.; Schachter, J.; Rutkowski, P.; Mackiewicz, A.; Stroiakovski, D.; Lichinitser, M.; Dummer, R.; Grange, F.; Mortier, L. Improved overall survival in melanoma with combined dabrafenib and trametinib. N. Engl. J. Med. 2015, 372, 30–39. [Google Scholar] [CrossRef]
9.   Liu, J.; Lichtenberg, T.; Hoadley, K.A.; Poisson, L.M.; Lazar, A.J.; Cherniack, A.D.; Kovatich, A.J.; Benz, C.C.; Levine, D.A.; Lee, A.V. An integrated TCGA pan-cancer clinical data resource to drive high-quality survival outcome analytics. Cell 2018, 173, 400–416.e411. [Google Scholar] [CrossRef]
10.   Zhang, J.; Baran, J.; Cros, A.; Guberman, J.M.; Haider, S.; Hsu, J.; Liang, Y.; Rivkin, E.; Wang, J.; Whitty, B. International Cancer Genome Consortium Data Portal—a one-stop shop for cancer genomics data. Database 2011, 2011, bar026. [Google Scholar] [CrossRef]
11.   Bamford, S.; Dawson, E.; Forbes, S.; Clements, J.; Pettett, R.; Dogan, A.; Flanagan, A.; Teague, J.; Futreal, P.A.; Stratton, M.R. The COSMIC (Catalogue of Somatic Mutations in Cancer) database and website. Br. J. Cancer 2004, 91, 355–358. [Google Scholar] [CrossRef] [PubMed]
12.   Gao, J.; Aksoy, B.A.; Dogrusoz, U.; Dresdner, G.; Gross, B.; Sumer, S.O.; Sun, Y.; Jacobsen, A.; Sinha, R.; Larsson, E. Integrative analysis of complex cancer genomics and clinical profiles using the cBioPortal. Sci. Signal. 2013, 6, pl1. [Google Scholar] [CrossRef] [PubMed]
13.   Chakravarty, D.; Gao, J.; Phillips, S.; Kundra, R.; Zhang, H.; Wang, J.; Rudolph, J.E.; Yaeger, R.; Soumerai, T.; Nissan, M.H. OncoKB: A precision oncology knowledge base. JCO Precis. Oncol. 2017, 1, PO.17.00011. [Google Scholar] [CrossRef] [PubMed]
14.   Goncearenco, A.; Rager, S.L.; Li, M.; Sang, Q.-X.; Rogozin, I.B.; Panchenko, A.R. Exploring background mutational processes to decipher cancer genetic heterogeneity. Nucleic Acids Res. 2017, 45, W514–W522. [Google Scholar] [CrossRef]
15.   Tamborero, D.; Rubio-Perez, C.; Deu-Pons, J.; Schroeder, M.P.; Vivancos, A.; Rovira, A.; Tusquets, I.; Albanell, J.; Rodon, J.; Tabernero, J. Cancer Genome Interpreter annotates the biological and clinical relevance of tumor alterations. Genome Med. 2018, 10, 25. [Google Scholar] [CrossRef]
16.   Forbes, S.A.; Beare, D.; Boutselakis, H.; Bamford, S.; Bindal, N.; Tate, J.; Cole, C.G.; Ward, S.; Dawson, E.; Ponting, L. COSMIC: Somatic cancer genetics at high-resolution. Nucleic Acids Res. 2017, 45, D777–D783. [Google Scholar] [CrossRef]
17.   Kumar, R.; Chaudhary, K.; Gupta, S.; Singh, H.; Kumar, S.; Gautam, A.; Kapoor, P.; Raghava, G.P. CancerDR: Cancer drug resistance database. Sci. Rep. 2013, 3, 1445. [Google Scholar] [CrossRef]
18.   Zarin, D.A.; Tse, T.; Williams, R.J.; Califf, R.M.; Ide, N.C. The ClinicalTrials. gov results database—update and key issues. N. Engl. J. Med. 2011, 364, 852–860. [Google Scholar] [CrossRef]
19.   Bianchi, J.J.; Zhao, X.; Mays, J.C.; Davoli, T. Not all cancers are created equal: Tissue specificity in cancer genes and pathways. Curr. Opin. Cell Biol. 2020, 63, 135–143. [Google Scholar] [CrossRef]
20.   Schaefer, M.H.; Serrano, L. Cell type-specific properties and environment shape tissue specificity of cancer genes. Sci. Rep. 2016, 6, 20707. [Google Scholar] [CrossRef]
21.   Haigis, K.M.; Cichowski, K.; Elledge, S.J. Tissue-specificity in cancer: The rule, not the exception. Science 2019, 363, 1150–1151. [Google Scholar] [CrossRef] [PubMed]
22.   Leone, P.J.; Mankaney, G.; Sarvapelli, S.; Abushamma, S.; Lopez, R.; Cruise, M.; LaGuardia, L.; O’Malley, M.; Church, J.M.; Kalady, M.F. Endoscopic and histologic features associated with gastric cancer in familial adenomatous polyposis. Gastrointest. Endosc. 2019, 89, 961–968. [Google Scholar] [CrossRef] [PubMed]
23.   Blair, V.R.; McLeod, M.; Carneiro, F.; Coit, D.G.; D’Addario, J.L.; van Dieren, J.M.; Harris, K.L.; Hoogerbrugge, N.; Oliveira, C.; van der Post, R.S. Hereditary diffuse gastric cancer: Updated clinical practice guidelines. Lancet Oncol. 2020, 21, e386–e397. [Google Scholar] [CrossRef]
24.   Tabano, S.; Azzollini, J.; Pesenti, C.; Lovati, S.; Costanza, J.; Fontana, L.; Peissel, B.; Miozzo, M.; Manoukian, S. Analysis of BRCA1 and RAD51C promoter methylation in italian families at high-risk of breast and ovarian cancer. Cancers 2020, 12, 910. [Google Scholar] [CrossRef]
25.   Tiacci, E.; Trifonov, V.; Schiavoni, G.; Holmes, A.; Kern, W.; Martelli, M.P.; Pucciarini, A.; Bigerna, B.; Pacini, R.; Wells, V.A. BRAF mutations in hairy-cell leukemia. N. Engl. J. Med. 2011, 364, 2305–2315. [Google Scholar] [CrossRef]
26.   Ottaviano, M.; Giunta, E.F.; Tortora, M.; Curvietto, M.; Attademo, L.; Bosso, D.; Cardalesi, C.; Rosanova, M.; De Placido, P.; Pietroluongo, E. BRAF gene and melanoma: Back to the future. Int. J. Mol. Sci. 2021, 22, 3474. [Google Scholar] [CrossRef]
27.   Li, X.; Kwon, H. The Impact of BRAF mutation on the recurrence of papillary thyroid carcinoma: A meta-analysis. Cancers 2020, 12, 2056. [Google Scholar] [CrossRef]
28.   Li, Z.-N.; Zhao, L.; Yu, L.-F.; Wei, M.-J. BRAF and KRAS mutations in metastatic colorectal cancer: Future perspectives for personalized therapy. Gastroenterol. Rep. 2020, 8, 192–205. [Google Scholar] [CrossRef]
29.   Levine, A.J. P53 and the immune response: 40 years of exploration—A plan for the future. Int. J. Mol. Sci. 2020, 21, 541. [Google Scholar] [CrossRef]
30.   Malekzadeh, P.; Pasetto, A.; Robbins, P.F.; Parkhurst, M.R.; Paria, B.C.; Jia, L.; Gartner, J.J.; Hill, V.; Yu, Z.; Restifo, N.P. Neoantigen screening identifies broad TP53 mutant immunogenicity in patients with epithelial cancers. J. Clin. Investig. 2019, 129, 1109–1114. [Google Scholar] [CrossRef]
31.   Saxton, S.N.; Clark, B.J.; Withers, S.B.; Eringa, E.C.; Heagerty, A.M. Mechanistic links between obesity, diabetes, and blood pressure: Role of perivascular adipose tissue. Physiol. Rev. 2019, 99, 1701–1763. [Google Scholar] [CrossRef] [PubMed]
32.   Gillies, R.J.; Pilot, C.; Marunaka, Y.; Fais, S. Targeting acidity in cancer and diabetes. Biochim. Et Biophys. Acta (BBA)-Rev. Cancer 2019, 1871, 273–280. [Google Scholar] [CrossRef] [PubMed]
33.   Wang, M.; Yang, Y.; Liao, Z. Diabetes and cancer: Epidemiological and biological links. World J. Diabetes 2020, 11, 227. [Google Scholar] [CrossRef] [PubMed]
34.   Abdalkareem, E.A.; Yin, K.B. A Current Perspective of in Association with Colorectal Carcinogenesis. Open Infect. Dis. J. 2019, 11, 7–12. [Google Scholar] [CrossRef]
35.   Rumgay, H.; Murphy, N.; Ferrari, P.; Soerjomataram, I. Alcohol and cancer: Epidemiology and biological mechanisms. Nutrients 2021, 13, 3173. [Google Scholar] [CrossRef]
36.   Zhou, Q.; Xi, S. A review on arsenic carcinogenesis: Epidemiology, metabolism, genotoxicity and epigenetic changes. Regul. Toxicol. Pharmacol. 2018, 99, 78–88. [Google Scholar] [CrossRef]
37.   Machlowska, J.; Baj, J.; Sitarz, M.; Maciejewski, R.; Sitarz, R. Gastric cancer: Epidemiology, risk factors, classification, genomic characteristics and treatment strategies. Int. J. Mol. Sci. 2020, 21, 4012. [Google Scholar] [CrossRef]
38.   Xu, M.; Jung, X.; Hines, O.J.; Eibl, G.; Chen, Y. Obesity and pancreatic cancer: Overview of epidemiology and potential prevention by weight loss. Pancreas 2018, 47, 158. [Google Scholar] [CrossRef]
39.   Ye, P.; Xi, Y.; Huang, Z.; Xu, P. Linking obesity with colorectal cancer: Epidemiology and mechanistic insights. Cancers 2020, 12, 1408. [Google Scholar] [CrossRef]
40.   Lim, A.R.; Rathmell, W.K.; Rathmell, J.C. The tumor microenvironment as a metabolic barrier to effector T cells and immunotherapy. Elife 2020, 9, e55185. [Google Scholar] [CrossRef]
41.   Wang, X. Stem cells in tissues, organoids, and cancers. Cell. Mol. Life Sci. 2019, 76, 4043–4070. [Google Scholar] [CrossRef] [PubMed]
42.   Clevers, H.; Loh, K.M.; Nusse, R. An integral program for tissue renewal and regeneration: Wnt signaling and stem cell control. Science 2014, 346, 1248012. [Google Scholar] [CrossRef] [PubMed]
43.   Dewi, D.L.; Ishii, H.; Kano, Y.; Nishikawa, S.; Haraguchi, N.; Sakai, D.; Satoh, T.; Doki, Y.; Mori, M. Cancer stem cell theory in gastrointestinal malignancies: Recent progress and upcoming challenges. J. Gastroenterol. 2011, 46, 1145. [Google Scholar] [CrossRef] [PubMed]
44.   Hass, R.; von der Ohe, J.; Ungefroren, H. Impact of the tumor microenvironment on tumor heterogeneity and consequences for cancer cell plasticity and stemness. Cancers 2020, 12, 3716. [Google Scholar] [CrossRef]
45.   Bussard, K.M.; Mutkus, L.; Stumpf, K.; Gomez-Manzano, C.; Marini, F.C. Tumor-associated stromal cells as key contributors to the tumor microenvironment. Breast Cancer Res. 2016, 18, 84. [Google Scholar] [CrossRef]
46.   Mao, Y.; Keller, E.T.; Garfield, D.H.; Shen, K.; Wang, J. Stromal cells in tumor microenvironment and breast cancer. Cancer Metastasis Rev. 2013, 32, 303–315. [Google Scholar] [CrossRef]
47.   Burrell, R.A.; McGranahan, N.; Bartek, J.; Swanton, C. The causes and consequences of genetic heterogeneity in cancer evolution. Nature 2013, 501, 338–345. [Google Scholar] [CrossRef]
48.   Chung, W.; Eum, H.H.; Lee, H.-O.; Lee, K.-M.; Lee, H.-B.; Kim, K.-T.; Ryu, H.S.; Kim, S.; Lee, J.E.; Park, Y.H. Single-cell RNA-seq enables comprehensive tumour and immune cell profiling in primary breast cancer. Nat. Commun. 2017, 8, 15081. [Google Scholar] [CrossRef]
49.   Ho, D.W.-H.; Tsui, Y.-M.; Chan, L.-K.; Sze, K.M.-F.; Zhang, X.; Cheu, J.W.-S.; Chiu, Y.-T.; Lee, J.M.-F.; Chan, A.C.-Y.; Cheung, E.T.-Y. Single-cell RNA sequencing shows the immunosuppressive landscape and tumor heterogeneity of HBV-associated hepatocellular carcinoma. Nat. Commun. 2021, 12, 3684. [Google Scholar] [CrossRef]
50.   Wu, F.; Fan, J.; He, Y.; Xiong, A.; Yu, J.; Li, Y.; Zhang, Y.; Zhao, W.; Zhou, F.; Li, W. Single-cell profiling of tumor heterogeneity and the microenvironment in advanced non-small cell lung cancer. Nat. Commun. 2021, 12, 2540. [Google Scholar] [CrossRef]
51.   Lander, E.S.; Linton, L.M.; Birren, B.; Nusbaum, C.; Zody, M.C.; Baldwin, J.; Devon, K.; Dewar, K.; Doyle, M.; FitzHugh, W. Initial sequencing and analysis of the human genome. Nature 2001, 409, 860–921. [Google Scholar] [PubMed]
52.   Tomczak, K.; Czerwińska, P.; Wiznerowicz, M. The Cancer Genome Atlas (TCGA): An immeasurable source of knowledge. Contemp. Oncol. 2015, 19, A68. [Google Scholar] [CrossRef] [PubMed]
53.   Neuhausen, S.L. Ethnic differences in cancer risk resulting from genetic variation. Cancer Interdiscip. Int. J. Am. Cancer Soc. 1999, 86, 2575–2582. [Google Scholar]
54.   Weise, N.; Shaya, J.; Javier-Desloges, J.; Cheng, H.H.; Madlensky, L.; McKay, R.R. Disparities in germline testing among racial minorities with prostate cancer. Prostate Cancer Prostatic Dis. 2021, 1–8. [Google Scholar] [CrossRef]
55.   Siva, N. 1000 Genomes project. Nat. Biotechnol. 2008, 26, 256–257. [Google Scholar] [CrossRef]
56.   Kanchi, K.L.; Johnson, K.J.; Lu, C.; McLellan, M.D.; Leiserson, M.D.; Wendl, M.C.; Zhang, Q.; Koboldt, D.C.; Xie, M.; Kandoth, C. Integrated analysis of germline and somatic variants in ovarian cancer. Nat. Commun. 2014, 5, 3156. [Google Scholar] [CrossRef]
57.   Vosoughi, A.; Zhang, T.; Shohdy, K.S.; Vlachostergios, P.J.; Wilkes, D.C.; Bhinder, B.; Tagawa, S.T.; Nanus, D.M.; Molina, A.M.; Beltran, H. Common germline-somatic variant interactions in advanced urothelial cancer. Nat. Commun. 2020, 11, 6195. [Google Scholar] [CrossRef]
58.   Wang, Y.; Wang, C.; Zhang, J.; Zhu, M.; Zhang, X.; Li, Z.; Dai, J.; Ma, H.; Hu, Z.; Jin, G. Interaction analysis between germline susceptibility loci and somatic alterations in lung cancer. Int. J. Cancer 2018, 143, 878–885. [Google Scholar] [CrossRef]
59.   Tate, J.G.; Bamford, S.; Jubb, H.C.; Sondka, Z.; Beare, D.M.; Bindal, N.; Boutselakis, H.; Cole, C.G.; Creatore, C.; Dawson, E. COSMIC: The catalogue of somatic mutations in cancer. Nucleic Acids Res. 2019, 47, D941–D947. [Google Scholar] [CrossRef]
60.   Knudson, A.G. Mutation and cancer: Statistical study of retinoblastoma. Proc. Natl. Acad. Sci. USA 1971, 68, 820–823. [Google Scholar] [CrossRef]
61.   Sanders, S.J.; Murtha, M.T.; Gupta, A.R.; Murdoch, J.D.; Raubeson, M.J.; Willsey, A.J.; Ercan-Sencicek, A.G.; DiLullo, N.M.; Parikshak, N.N.; Stein, J.L. De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature 2012, 485, 237–241. [Google Scholar] [CrossRef] [PubMed]
62.   Kong, A.; Frigge, M.L.; Masson, G.; Besenbacher, S.; Sulem, P.; Magnusson, G.; Gudjonsson, S.A.; Sigurdsson, A.; Jonasdottir, A.; Jonasdottir, A. Rate of de novo mutations and the importance of father’s age to disease risk. Nature 2012, 488, 471–475. [Google Scholar] [CrossRef] [PubMed]
63.   Suzuki, A.; Katoh, H.; Komura, D.; Kakiuchi, M.; Tagashira, A.; Yamamoto, S.; Tatsuno, K.; Ueda, H.; Nagae, G.; Fukuda, S. Defined lifestyle and germline factors predispose Asian populations to gastric cancer. Sci. Adv. 2020, 6, eaav9778. [Google Scholar] [CrossRef]
64.   Katoh, H.; Ishikawa, S. Lifestyles, genetics, and future perspectives on gastric cancer in east Asian populations. J. Hum. Genet. 2021, 66, 887–899. [Google Scholar] [CrossRef] [PubMed]
65.   Ngeow, J.; Eng, C. Precision medicine in heritable cancer: When somatic tumour testing and germline mutations meet. NPJ Genom. Med. 2016, 1, 15006. [Google Scholar] [CrossRef] [PubMed]
66.   Hwangbo, S.; Kim, S.I.; Kim, J.-H.; Eoh, K.J.; Lee, C.; Kim, Y.T.; Suh, D.-S.; Park, T.; Song, Y.S. Development of Machine Learning Models to Predict Platinum Sensitivity of High-Grade Serous Ovarian Carcinoma. Cancers 2021, 13, 1875. [Google Scholar] [CrossRef]
67.   Baptiste, M.; Moinuddeen, S.S.; Soliz, C.L.; Ehsan, H.; Kaneko, G. Making Sense of Genetic Information: The Promising Evolution of Clinical Stratification and Precision Oncology Using Machine Learning. Genes 2021, 12, 722. [Google Scholar] [CrossRef]
68.   Rehm, H.L.; Berg, J.S.; Brooks, L.D.; Bustamante, C.D.; Evans, J.P.; Landrum, M.J.; Ledbetter, D.H.; Maglott, D.R.; Martin, C.L.; Nussbaum, R.L. ClinGen—The clinical genome resource. N. Engl. J. Med. 2015, 372, 2235–2242. [Google Scholar] [CrossRef]
69.   Sun, J.X.; He, Y.; Sanford, E.; Montesion, M.; Frampton, G.M.; Vignot, S.; Soria, J.-C.; Ross, J.S.; Miller, V.A.; Stephens, P.J. A computational approach to distinguish somatic vs. germline origin of genomic alterations from deep sequencing of cancer specimens without a matched normal. PLoS Comput. Biol. 2018, 14, e1005965. [Google Scholar] [CrossRef]
70.   Qing, T.; Mohsen, H.; Marczyk, M.; Ye, Y.; O’Meara, T.; Zhao, H.; Townsend, J.P.; Gerstein, M.; Hatzis, C.; Kluger, Y. Germline variant burden in cancer genes correlates with age at diagnosis and somatic mutation burden. Nat. Commun. 2020, 11, 2438. [Google Scholar] [CrossRef]
71.   Tennessen, J.A.; Bigham, A.W.; O’Connor, T.D.; Fu, W.; Kenny, E.E.; Gravel, S.; McGee, S.; Do, R.; Liu, X.; Jun, G. Evolution and functional impact of rare coding variation from deep sequencing of human exomes. Science 2012, 337, 64–69. [Google Scholar] [CrossRef] [PubMed]
72.   Thorn, C.F.; Klein, T.E.; Altman, R.B. PharmGKB: The pharmacogenomics knowledge base. In Pharmacogenomics; Springer: Berlin, Germany, 2013; pp. 311–320. [Google Scholar]
73.   Wishart, D.S.; Feunang, Y.D.; Guo, A.C.; Lo, E.J.; Marcu, A.; Grant, J.R.; Sajed, T.; Johnson, D.; Li, C.; Sayeeda, Z. DrugBank 5.0: A major update to the DrugBank database for 2018. Nucleic Acids Res. 2018, 46, D1074–D1082. [Google Scholar] [CrossRef] [PubMed]
74.   Madhukar, N.S.; Elemento, O. Bioinformatics approaches to predict drug responses from genomic sequencing. Cancer Syst. Biol. 2018, 1711, 277–296. [Google Scholar]
75.   Mak, A.C.; White, M.J.; Eckalbar, W.L.; Szpiech, Z.A.; Oh, S.S.; Pino-Yanes, M.; Hu, D.; Goddard, P.; Huntsman, S.; Galanter, J. Whole-genome sequencing of pharmacogenetic drug response in racially diverse children with asthma. Am. J. Respir. Crit. Care Med. 2018, 197, 1552–1564. [Google Scholar] [CrossRef]
76.   Sankar, P.L.; Parker, L.S. The Precision Medicine Initiative’s All of Us Research Program: An agenda for research on its ethical, legal, and social issues. Genet. Med. 2017, 19, 743–750. [Google Scholar] [CrossRef] [PubMed]
77.   Carrasco-Ramiro, F.; Peiró-Pastor, R.; Aguado, B. Human genomics projects and precision medicine. Gene Ther. 2017, 24, 551–561. [Google Scholar] [CrossRef] [PubMed]
78.   Gudbjartsson, D.F.; Helgason, H.; Gudjonsson, S.A.; Zink, F.; Oddson, A.; Gylfason, A.; Besenbacher, S.; Magnusson, G.; Halldorsson, B.V.; Hjartarson, E. Large-scale whole-genome sequencing of the Icelandic population. Nat. Genet. 2015, 47, 435–444. [Google Scholar] [CrossRef]
79.   Ritari, J.; Hyvärinen, K.; Clancy, J.; FinnGen; Partanen, J.; Koskela, S. Increasing accuracy of HLA imputation by a population-specific reference panel in a FinnGen biobank cohort. NAR Genom. Bioinform. 2020, 2, lqaa030. [Google Scholar] [CrossRef]
80.   Seong-Jin, P.; Cho, K.H.; Daeyeon, K.; Jeon, Y.J.; Jin, T.E.; Choe, Y.K. Korean Bio-resource Information System (KOBIS): The Nationwide Infrastructure for Collecting and Integrating Biological Resource Information in Korea. Biodivers. Inf. Sci. Stand. 2018, 2, e26286. [Google Scholar]
81.   Yun, Y.; Kim, H.-N.; Kim, S.E.; Heo, S.G.; Chang, Y.; Ryu, S.; Shin, H.; Kim, H.-L. Comparative analysis of gut microbiota associated with body mass index in a large Korean cohort. BMC Microbiol. 2017, 17, 151. [Google Scholar] [CrossRef]
82.   Boomsma, D.I.; Wijmenga, C.; Slagboom, E.P.; Swertz, M.A.; Karssen, L.C.; Abdellaoui, A.; Ye, K.; Guryev, V.; Vermaat, M.; Van Dijk, F. The Genome of the Netherlands: Design, and project goals. Eur. J. Hum. Genet. 2014, 22, 221–227. [Google Scholar] [CrossRef] [PubMed]
83.   Teo, Y.-Y.; Sim, X.; Ong, R.T.; Tan, A.K.; Chen, J.; Tantoso, E.; Small, K.S.; Ku, C.-S.; Lee, E.J.; Seielstad, M. Singapore Genome Variation Project: A haplotype map of three Southeast Asian populations. Genome Res. 2009, 19, 2154–2162. [Google Scholar] [CrossRef] [PubMed]
84.   Herbst, R.S.; Gandara, D.R.; Hirsch, F.R.; Redman, M.W.; LeBlanc, M.; Mack, P.C.; Schwartz, L.H.; Vokes, E.; Ramalingam, S.S.; Bradley, J.D. Lung Master Protocol (Lung-MAP)—A biomarker-driven protocol for accelerating development of therapies for squamous cell lung cancer: SWOG S1400. Clin. Cancer Res. 2015, 21, 1514–1524. [Google Scholar] [CrossRef] [PubMed]
85.   Hyman, D.M.; Puzanov, I.; Subbiah, V.; Faris, J.E.; Chau, I.; Blay, J.-Y.; Wolf, J.; Raje, N.S.; Diamond, E.L.; Hollebecque, A. Vemurafenib in multiple nonmelanoma cancers with BRAF V600 mutations. N. Engl. J. Med. 2015, 373, 726–736. [Google Scholar] [CrossRef]
86.   Garralda, E.; Dienstmann, R.; Piris-Giménez, A.; Braña, I.; Rodon, J.; Tabernero, J. New clinical trial designs in the era of precision medicine. Mol. Oncol. 2019, 13, 549–557. [Google Scholar] [CrossRef]
87.   Kraus, V.B. Biomarkers as drug development tools: Discovery, validation, qualification and use. Nat. Rev. Rheumatol. 2018, 14, 354–362. [Google Scholar] [CrossRef]
88.   Pant, S.; Weiner, R.; Marton, M.J. Navigating the rapids: The development of regulated next-generation sequencing-based clinical trial assays and companion diagnostics. Front. Oncol. 2014, 4, 78. [Google Scholar] [CrossRef]
89.   Jackson, S.E.; Chester, J.D. Personalised cancer medicine. Int. J. Cancer 2015, 137, 262–266. [Google Scholar] [CrossRef]
90.   Camidge, D.R.; Kim, E.E.; Usari, T.; Polli, A.; Lewis, I.; Wilner, K.D. Renal Effects of Crizotinib in Patients With ALK-Positive Advanced NSCLC. J. Thorac. Oncol. 2019, 14, 1077–1085. [Google Scholar] [CrossRef]
91.   Lin, Y.-T.; Wang, Y.-F.; Yang, J.C.-H.; Yu, C.-J.; Wu, S.-G.; Shih, J.-Y.; Yang, P.-C. Development of renal cysts after crizotinib treatment in advanced ALK-positive non–small-cell lung cancer. J. Thorac. Oncol. 2014, 9, 1720–1725. [Google Scholar] [CrossRef]
92.   de Wit, R.; de Bono, J.; Sternberg, C.N.; Fizazi, K.; Tombal, B.; Wülfing, C.; Kramer, G.; Eymard, J.-C.; Bamias, A.; Carles, J. Cabazitaxel versus abiraterone or enzalutamide in metastatic prostate cancer. N. Engl. J. Med. 2019, 381, 2506–2518. [Google Scholar] [CrossRef] [PubMed]
93.   Ballotta, L.; Zinzani, P.L.; Pileri, S.; Bruna, R.; Tani, M.; Casadei, B.; Tabanelli, V.; Volpetti, S.; Luminari, S.; Corradini, P. Venetoclax Shows Low Therapeutic Activity in BCL2-Positive Relapsed/Refractory Peripheral T-Cell Lymphoma: A Phase 2 Study of the Fondazione Italiana Linfomi. Front. Oncol. 2021, 11, 789891. [Google Scholar] [CrossRef] [PubMed]
94.   Mahadevan, D.; Cooke, L.; Riley, C.; Swart, R.; Simons, B.; Della Croce, K.; Wisner, L.; Iorio, M.; Shakalya, K.; Garewal, H. A novel tyrosine kinase switch is a mechanism of imatinib resistance in gastrointestinal stromal tumors. Oncogene 2007, 26, 3909–3919. [Google Scholar] [CrossRef] [PubMed]
95.   Baxter, E.J.; Kulkarni, S.; Vizmanos, J.L.; Jaju, R.; Martinelli, G.; Testoni, N.; Hughes, G.; Salamanchuk, Z.; Calasanz, M.J.; Lahortiga, I. Novel translocations that disrupt the platelet-derived growth factor receptor β (PDGFRB) gene in BCR–ABL-negative chronic myeloproliferative disorders. Br. J. Haematol. 2003, 120, 251–256. [Google Scholar] [CrossRef]
96.   Gelmon, K.A.; Fasching, P.A.; Couch, F.J.; Balmaña, J.; Delaloge, S.; Labidi-Galy, I.; Bennett, J.; McCutcheon, S.; Walker, G.; O’Shaughnessy, J. Clinical effectiveness of olaparib monotherapy in germline BRCA-mutated, HER2-negative metastatic breast cancer in a real-world setting: Phase IIIb LUCY interim analysis. Eur. J. Cancer 2021, 152, 68–77. [Google Scholar] [CrossRef]
97.   Haag, G.; Zoernig, I.; Hassel, J.; Halama, N.; Dick, J.; Lang, N.; Podola, L.; Funk, J.; Ziegelmeier, C.; Juenger, S. Phase II trial of ipilimumab in melanoma patients with preexisting humoural immune response to NY-ESO-1. Eur. J. Cancer 2018, 90, 122–129. [Google Scholar] [CrossRef]
98.   Francis, P.A.; Regan, M.M.; Fleming, G.F.; Láng, I.; Ciruelos, E.; Bellet, M.; Bonnefoi, H.R.; Climent, M.A.; Da Prada, G.A.; Burstein, H.J. Adjuvant ovarian suppression in premenopausal breast cancer. N. Engl. J. Med. 2015, 372, 436–446. [Google Scholar] [CrossRef]
99.   Elledge, R.M.; Green, S.; Pugh, R.; Allred, D.C.; Clark, G.M.; Hill, J.; Ravdin, P.; Martino, S.; Osborne, C.K. Estrogen receptor (ER) and progesterone receptor (PgR), by ligand-binding assay compared with ER, PgR and pS2, by immuno-histochemistry in predicting response to tamoxifen in metastatic breast cancer: A Southwest Oncology Group study. Int. J. Cancer 2000, 89, 111–117. [Google Scholar] [CrossRef]
100.   Cortés, J.; Diéras, V.; Lorenzen, S.; Montemurro, F.; Riera-Knorrenschild, J.; Thuss-Patience, P.; Allegrini, G.; De Laurentiis, M.; Lohrisch, C.; Oravcová, E. Efficacy and safety of Trastuzumab Emtansine plus Capecitabine vs Trastuzumab Emtansine alone in patients with previously treated ERBB2 (HER2)-positive metastatic breast Cancer: A phase 1 and randomized phase 2 trial. JAMA Oncol. 2020, 6, 1203–1209. [Google Scholar] [CrossRef]
101.   Facchinetti, F.; Hollebecque, A.; Bahleda, R.; Loriot, Y.; Olaussen, K.A.; Massard, C.; Friboulet, L. Facts and new hopes on selective FGFR inhibitors in solid tumors. Clin. Cancer Res. 2020, 26, 764–774. [Google Scholar] [CrossRef]
102.   Ma, J.; Zhao, S.; Qiao, X.; Knight, T.; Edwards, H.; Polin, L.; Kushner, J.; Dzinic, S.H.; White, K.; Wang, G. Inhibition of Bcl-2 synergistically enhances the antileukemic activity of midostaurin and gilteritinib in preclinical models of FLT3-mutated acute myeloid leukemia. Clin. Cancer Res. 2019, 25, 6815–6826. [Google Scholar] [CrossRef] [PubMed]
103.   Stein, E.M.; DiNardo, C.D.; Fathi, A.T.; Mims, A.S.; Pratz, K.W.; Savona, M.R.; Stein, A.S.; Stone, R.M.; Winer, E.S.; Seet, C.S. Ivosidenib or enasidenib combined with intensive chemotherapy in patients with newly diagnosed AML: A phase 1 study. Blood J. Am. Soc. Hematol. 2021, 137, 1792–1803. [Google Scholar] [CrossRef]
104.   Shaw, A.T.; Ou, S.-H.I.; Bang, Y.-J.; Camidge, D.R.; Solomon, B.J.; Salgia, R.; Riely, G.J.; Varella-Garcia, M.; Shapiro, G.I.; Costa, D.B. Crizotinib in ROS1-rearranged non–small-cell lung cancer. N. Engl. J. Med. 2014, 371, 1963–1971. [Google Scholar] [CrossRef] [PubMed]
105.   Mendonça Gorgulho, C.; Krishnamurthy, A.; Lanzi, A.; Galon, J.; Housseau, F.; Kaneno, R.; Lotze, M.T. Gutting it Out: Developing Effective Immunotherapies for Patients With Colorectal Cancer. J. Immunother. 2021, 44, 49–62. [Google Scholar] [CrossRef] [PubMed]
106.   Overman, M.J.; McDermott, R.; Leach, J.L.; Lonardi, S.; Lenz, H.-J.; Morse, M.A.; Desai, J.; Hill, A.; Axelson, M.; Moss, R.A. Nivolumab in patients with metastatic DNA mismatch repair-deficient or microsatellite instability-high colorectal cancer (CheckMate 142): An open-label, multicentre, phase 2 study. Lancet Oncol. 2017, 18, 1182–1191. [Google Scholar] [CrossRef]
107.   Hong, D.S.; DuBois, S.G.; Kummar, S.; Farago, A.F.; Albert, C.M.; Rohrberg, K.S.; van Tilburg, C.M.; Nagasubramanian, R.; Berlin, J.D.; Federman, N. Larotrectinib in patients with TRK fusion-positive solid tumours: A pooled analysis of three phase 1/2 clinical trials. Lancet Oncol. 2020, 21, 531–540. [Google Scholar] [CrossRef]
108.   André, F.; Ciruelos, E.; Rubovszky, G.; Campone, M.; Loibl, S.; Rugo, H.S.; Iwata, H.; Conte, P.; Mayer, I.A.; Kaufman, B. Alpelisib for PIK3CA-mutated, hormone receptor–positive advanced breast cancer. N. Engl. J. Med. 2019, 380, 1929–1940. [Google Scholar] [CrossRef]
109.   Dreyling, M.; Morschhauser, F.; Bouabdallah, K.; Bron, D.; Cunningham, D.; Assouline, S.; Verhoef, G.; Linton, K.; Thieblemont, C.; Vitolo, U. Phase II study of copanlisib, a PI3K inhibitor, in relapsed or refractory, indolent or aggressive lymphoma. Ann. Oncol. 2017, 28, 2169–2178. [Google Scholar] [CrossRef]
110.   Flinn, I.W.; O’Brien, S.; Kahl, B.; Patel, M.; Oki, Y.; Foss, F.F.; Porcu, P.; Jones, J.; Burger, J.A.; Jain, N. Duvelisib, a novel oral dual inhibitor of PI3K-δ, γ, is clinically active in advanced hematologic malignancies. Blood J. Am. Soc. Hematol. 2018, 131, 877–887. [Google Scholar] [CrossRef]
111.   Dmello, R.S.; To, S.Q.; Chand, A.L. Therapeutic Targeting of the Tumour Microenvironment in Metastatic Colorectal Cancer. Int. J. Mol. Sci. 2021, 22, 2067. [Google Scholar] [CrossRef]
112.   Drilon, A.; Oxnard, G.R.; Tan, D.S.; Loong, H.H.; Johnson, M.; Gainor, J.; McCoach, C.E.; Gautschi, O.; Besse, B.; Cho, B.C. Efficacy of selpercatinib in RET fusion–positive non–small-cell lung cancer. N. Engl. J. Med. 2020, 383, 813–824. [Google Scholar] [CrossRef] [PubMed]
113.   Melosky, B.; Wheatley-Price, P.; Juergens, R.A.; Sacher, A.; Leighl, N.B.; Tsao, M.-S.; Cheema, P.; Snow, S.; Liu, G.; Card, P.B. The Rapidly Evolving Landscape of Novel Targeted Therapies in Advanced Non-Small Cell Lung Cancer. Lung Cancer 2021, 160, 136–151. [Google Scholar] [CrossRef] [PubMed]
114.   Tsamandouras, N.; Dickinson, G.; Guo, Y.; Hall, S.; Rostami-Hodjegan, A.; Galetin, A.; Aarons, L. Identification of the effect of multiple polymorphisms on the pharmacokinetics of simvastatin and simvastatin acid using a population-modeling approach. Clin. Pharmacol. Ther. 2014, 96, 90–100. [Google Scholar] [CrossRef] [PubMed]
115.   Iyengar, R.; Zhao, S.; Chung, S.W.; Mager, D.E.; Gallo, J.M. Merging systems biology with pharmacodynamics. Sci. Transl. Med. 2012, 4, 126ps127. [Google Scholar] [CrossRef]
116.   Nierman, D.M. Tools that we use: If you can’t measure it, you can’t manage it. Crit. Care Med. 2007, 35, 312–313. [Google Scholar] [CrossRef]
117.   García, S.A.; Reyes Román, J.F.; Casamayor, J.C.; Pastor, O. Towards an effective and efficient management of genome data: An information systems engineering perspective. In Proceedings of the International Conference on Advanced Information Systems Engineering, Rome, Italy, 3–7 June 2019; pp. 99–110. [Google Scholar]
118.   Duyzend, M.H. Genomic medicine in a community hospital setting. J. Pediatr. 2021, 239, 1–4. [Google Scholar] [CrossRef]
119.   Gim, J.-A. A Genomic Information Management System for Maintaining Healthy Genomic States and Application of Genomic Big Data in Clinical Research. Int. J. Mol. Sci. 2022, 23, 5963. [Google Scholar] [CrossRef]
120.   Bianchi, V.; Colantoni, A.; Calderone, A.; Ausiello, G.; Ferre, F.; Helmer-Citterich, M. DBATE: Database of alternative transcripts expression. Database 2013, 2013, bat050. [Google Scholar] [CrossRef]
121.   Baek, S.-J.; Yang, S.; Kang, T.-W.; Park, S.-M.; Kim, Y.S.; Kim, S.-Y. MENT: Methylation and expression database of normal and tumor tissues. Gene 2013, 518, 194–200. [Google Scholar] [CrossRef]
122.   Feng, C.; Araki, M.; Kunimoto, R.; Tamon, A.; Makiguchi, H.; Niijima, S.; Tsujimoto, G.; Okuno, Y. GEM-TREND: A web tool for gene expression data mining toward relevant network discovery. BMC Genom. 2009, 10, 411. [Google Scholar] [CrossRef]
123.   Reibe, S.; Hjorth, M.; Febbraio, M.A.; Whitham, M. GeneXX: An online tool for the exploration of transcript changes in skeletal muscle associated with exercise. Physiol. Genom. 2018, 50, 376–384. [Google Scholar] [CrossRef] [PubMed]
124.   Canela-Xandri, O.; Rawlik, K.; Tenesa, A. An atlas of genetic associations in UK Biobank. Nat. Genet. 2018, 50, 1593–1599. [Google Scholar] [CrossRef] [PubMed]
125.   Yang, Y.; Sui, Y.; Xie, B.; Qu, H.; Fang, X. GliomaDB: A web server for integrating glioma omics data and interactive analysis. Genom. Proteom. Bioinform. 2019, 17, 465–471. [Google Scholar] [CrossRef] [PubMed]
126.   Pillon, N.J.; Gabriel, B.M.; Dollet, L.; Smith, J.A.; Puig, L.S.; Botella, J.; Bishop, D.J.; Krook, A.; Zierath, J.R. Transcriptomic profiling of skeletal muscle adaptations to exercise and inactivity. Nat. Commun. 2020, 11, 470. [Google Scholar] [CrossRef]
127.   Lee, J.; Choi, C. Oncopression: Gene expression compendium for cancer with matched normal tissues. Bioinformatics 2017, 33, 2068–2070. [Google Scholar] [CrossRef]
128.   Ono, H.; Ogasawara, O.; Okubo, K.; Bono, H. RefEx, a reference gene expression dataset as a web tool for the functional analysis of genes. Sci. Data 2017, 4, 170105. [Google Scholar] [CrossRef]
129.   Chen, G.; Ramírez, J.C.; Deng, N.; Qiu, X.; Wu, C.; Zheng, W.J.; Wu, H. Restructured GEO: Restructuring Gene Expression Omnibus metadata for genome dynamics analysis. Database 2019, 2019, bay145. [Google Scholar] [CrossRef]

Source: https://www.mdpi.com/1999-4923/14/8/1539/html

Medical Science Research Center, College of Medicine, Korea University Guro Hospital, Seoul 08308, Korea

Division of Pulmonary, Allergy and Critical Care Medicine, Department of Internal Medicine, Korea University Guro Hospital, Seoul 08308, Korea

Clarivate - Best practices in toxicology report

Bend Bioscience - The Development-GMP Gap

Clarivate - Companies to watch protein degraders report

Clarivate - Emerging Degrader Modalities on-demand webinar