【问题标题】:Extract only rows from a specific dataset in R?仅从 R 中的特定数据集中提取行?
【发布时间】:2021-08-15 23:16:19
【问题描述】:

我有一个如下所示的表格:

df1 <- data.frame(
    "seqid" = c("12", "12", "13", "12", "12", "15"),
    "source" = c("star", "star", "star", "star", "star", "star"), 
    "type" = c("CDS", "CDS", "CDS", "intron", "CDS", "intron"),
    "start" = c("15", "21", "23", "35", "45", "60"), 
    "end" = c("70", "80", "86", "45", "67", "88"),
     "attributes" = c("ENSOCUT00000011013", "ENSOCUT00000064484", 
      "ENSOCUT00000013302", 
     "ENSOCUT00000010968", "ENSOCUT00000010968", "ENSOCUT00000060283"),
    stringsAsFactors = F,check.names=FALSE)
seqid source Type start end attributes
12 star CDS 15 70 ENSOCUT00000011013
12 star CDS 21 80 ENSOCUT00000064484
12 star CDS 23 86 ENSOCUT00000013302
12 star intron 35 45 ENSOCUT00000010968
12 star CDS 45 67 ENSOCUT00000010968
12 star intron 60 88 ENSOCUT00000060283

我只想提取第 1、2、3、5 行,最终结果如下所示:

seqid source Type start end attributes
12 star CDS 15 70 ENSOCUT00000011013
12 star CDS 21 80 ENSOCUT00000064484
12 star CDS 23 86 ENSOCUT00000013302
12 star CDS 45 67 ENSOCUT00000010968

【问题讨论】:

  • 这里需要df1[df1$Type == 'CDS',]吗?

标签: r sorting row


【解决方案1】:
df1[c(1,2,3,5),]

通常在data.frame df的括号中选择编号的行/列:

df[rows_selected_go_here, columns_selected_go_here]

【讨论】:

    【解决方案2】:

    我假设您只想要来自df 的那些条目,其中Type(字符串)等于CDS

    library(tidyverse)
    df <- mutate(df, TOBINCL= 0)
    df$TOBINCL[grepl("^CDS$", df$Type, ignore.case = TRUE)] <- 1
    mynewdf <- df[(df$TOBINCL==1) , ]
    

    【讨论】:

      猜你喜欢
      • 1970-01-01
      • 1970-01-01
      • 2022-01-17
      • 1970-01-01
      • 2022-12-17
      • 2017-01-27
      • 2015-12-12
      • 2011-11-19
      • 1970-01-01
      相关资源
      最近更新 更多