【发布时间】:2025-12-02 01:40:01
【问题描述】:
我如何编写一个 R 函数,它可以采用两个字符串向量并返回常用词的数量以及比较从 stringvec1 的元素 1 到 stringvec2 的元素 1、stringvec1 的元素 2 到 stringvec2 的元素 2 等的常用词的数量。
假设我有这些数据:
#string vector 1
strvec1 <- c("Griffin Rahea Petersen Deana Franks Morgan","Story Keisha","Douglas Landon Lark","Kinsman Megan Thrall Michael Michels Breann","Gutierrez Mccoy Tyler West* Grayson Swank Shirley Didas Moriah")
#string vector 2
strvec2 <- c("Griffin Morgan Rose Manuel","Van De Grift Sarah Sell William","Mark Landon Lark","Beerman Carlee Megan Thrall Michels","Mcmillan Tyler Jonathan West* Grayson Didas Lloyd Connor")
理想情况下,我有一个函数可以返回常用词的数量以及常用词是什么:
#Non working sample of how functions would ideally work
desiredfunction_numwords(strvec1,strvec2)
[1] 2 0 2 3 4
desiredfunction_matchwords(strvec1,strvec2)
[1] "Griffin Morgan" "" "Landon Lark" "Megan Thrall Michels" "Tyler West* Grayson Didas"
【问题讨论】:
标签: r string substring string-matching longest-substring