【发布时间】:2021-08-15 23:16:19
【问题描述】:
我有一个如下所示的表格:
df1 <- data.frame(
"seqid" = c("12", "12", "13", "12", "12", "15"),
"source" = c("star", "star", "star", "star", "star", "star"),
"type" = c("CDS", "CDS", "CDS", "intron", "CDS", "intron"),
"start" = c("15", "21", "23", "35", "45", "60"),
"end" = c("70", "80", "86", "45", "67", "88"),
"attributes" = c("ENSOCUT00000011013", "ENSOCUT00000064484",
"ENSOCUT00000013302",
"ENSOCUT00000010968", "ENSOCUT00000010968", "ENSOCUT00000060283"),
stringsAsFactors = F,check.names=FALSE)
| seqid | source | Type | start | end | attributes |
|---|---|---|---|---|---|
| 12 | star | CDS | 15 | 70 | ENSOCUT00000011013 |
| 12 | star | CDS | 21 | 80 | ENSOCUT00000064484 |
| 12 | star | CDS | 23 | 86 | ENSOCUT00000013302 |
| 12 | star | intron | 35 | 45 | ENSOCUT00000010968 |
| 12 | star | CDS | 45 | 67 | ENSOCUT00000010968 |
| 12 | star | intron | 60 | 88 | ENSOCUT00000060283 |
我只想提取第 1、2、3、5 行,最终结果如下所示:
| seqid | source | Type | start | end | attributes |
|---|---|---|---|---|---|
| 12 | star | CDS | 15 | 70 | ENSOCUT00000011013 |
| 12 | star | CDS | 21 | 80 | ENSOCUT00000064484 |
| 12 | star | CDS | 23 | 86 | ENSOCUT00000013302 |
| 12 | star | CDS | 45 | 67 | ENSOCUT00000010968 |
【问题讨论】:
-
这里需要
df1[df1$Type == 'CDS',]吗?