【问题标题】:Download file from internet via R despite the popup尽管弹出窗口,但通过 R 从 Internet 下载文件
【发布时间】:2014-03-23 12:48:09
【问题描述】:

使用 R 从 Internet 下载文件很简单,has been addressed previously

我的问题是关于如何绕过似乎阻止我执行下载的弹出消息。具体来说,

download.file(url = "https://www.chicagofed.org/applications/bhc_data/bhcdata_index.cfm?DYR=2012&DQIR=4", destfile = "data/test.zip")

给了我一个小垃圾文件,而不是你想要的 18 兆字节文件,如果你去the website 并手动输入年份2012 和季度4。我怀疑问题在于,正如您手动执行的那样,弹出窗口会中断下载过程,询问是保存文件还是打开文件。有没有办法自动通过弹出窗口(即通过download.file)?

【问题讨论】:

    标签: r download popup


    【解决方案1】:

    这可以通过 Selenium 完成,请参阅 https://github.com/ropensci/RSelenium

    require(wdman)
    require(RSelenium)
    
    
    selPort <- 4444L
    fprof <- makeFirefoxProfile(list(browser.download.dir = "C:\\temp"
                                     ,  browser.download.folderList = 2L
                                     , browser.download.manager.showWhenStarting = FALSE
                                     , browser.helperApps.neverAsk.saveToDisk = "application/zip"))
    selServ <- selenium(port = selPort)
    remDr <- remoteDriver(extraCapabilities = fprof, port = selPort)
    remDr$open(silent = TRUE)
    remDr$navigate("https://www.chicagofed.org/applications/bhc_data/bhcdata_index.cfm")
    # click year 2012
    webElem <- remDr$findElement("name", "SelectedYear")
    webElems <- webElem$findChildElements("css selector", "option")
    webElems[[which(sapply(webElems, function(x){x$getElementText()}) == "2012" )]]$clickElement()
    
    # click required quarter
    
    webElem <- remDr$findElement("name", "SelectedQuarter")
    Sys.sleep(1)
    webElems <- webElem$findChildElements("css selector", "option")
    webElems[[which(sapply(webElems, function(x){x$getElementText()}) == "4th Quarter" )]]$clickElement()
    
    # click button
    
    webElem <- remDr$findElement("id", "downloadDataFile")
    webElem$clickElement()
    

    【讨论】:

      【解决方案2】:

      【讨论】:

        猜你喜欢
        • 1970-01-01
        • 2018-10-16
        • 1970-01-01
        • 1970-01-01
        • 1970-01-01
        • 1970-01-01
        • 1970-01-01
        • 2013-07-30
        • 1970-01-01
        相关资源
        最近更新 更多