【问题标题】:VBA xmlhttp GET - getting data from table with irregular structureVBA xmlhttp GET - 从具有不规则结构的表中获取数据
【发布时间】:2016-11-28 00:36:44
【问题描述】:

我尝试通过 xmlhttp GET 从网站获取数据。不幸的是,表格在一行或一列中没有恒定数量的列,因为某些单元格被合并(我什至不得不在宏中手动将最大列数更改为 11,因为第一行的列数较少)。

我希望输出与网站上的完全一样。

Option Explicit

Public Sub GetTable()

Dim oDom As Object: Set oDom = CreateObject("htmlFile")
Dim x As Long, y As Long
Dim oRow As Object, oCell As Object
Dim vData As Variant
Dim link As String

link = "http://medicarestatistics.humanservices.gov.au/statistics/do.jsp?_PROGRAM=%2Fstatistics%2Fmbs_group_standard_report&DRILL=on&GROUP=Broad+Type+of+Service+%28BTOS%29&VAR=services&STAT=count&RPT_FMT=by+time+period+and+state&PTYPE=month&START_DT=201609&END_DT=201609"

y = 1: x = 1

With CreateObject("msxml2.xmlhttp")
    .Open "GET", link, False
    .Send
    oDom.body.innerHtml = .responseText
End With

With oDom.getelementsbytagname("table")(0)
    ReDim vData(1 To .Rows.Length, 1 To 11) '.Rows(1).Cells.Length)
    For Each oRow In .Rows
        For Each oCell In oRow.Cells
            vData(x, y) = oCell.innerText
            y = y + 1
        Next oCell
        y = 1
        x = x + 1
    Next oRow
End With

Sheets(1).Cells(1, 1).Resize(UBound(vData), UBound(vData, 2)).Value = vData
End Sub

【问题讨论】:

  • 您需要检查每个 TD/TH 元素的 colSpan 属性,并为任何 colSpan > 1 创建一个合并单元格

标签: vba xmlhttprequest


【解决方案1】:

每次通过循环检查行长度,如果需要更多列,则调整数组大小:

With oDom.getelementsbytagname("table")(0)
    Dim rowCount As Long
    rowCount = .Rows.Length
    ReDim vData(1 To rowCount, 1 To .Rows(0).Cells.Length)
    For Each oRow In .Rows
        Dim columnCount As Long
        columnCount = .Rows(x - 1).Cells.Length
        If columnCount > UBound(vData, 2) Then
            ReDim Preserve vData(1 To rowCount, 1 To columnCount)
        End If
        For Each oCell In oRow.Cells
            vData(x, y) = oCell.innerText
            y = y + 1
        Next oCell
        y = 1
        x = x + 1
    Next oRow
End With

编辑:

未检查源表中的列跨度。一种选择是使用@Thunderframe 的建议并测试所有列跨度,但这似乎有点乏味。我个人会利用 Excel 知道如何从剪贴板粘贴 HTML,让 Excel 自己弄清楚这一事实:

With oDom.getelementsbytagname("table")(0)
    Dim dataObj As Object
    Set dataObj = CreateObject("new:{1C3B4210-F441-11CE-B9EA-00AA006B1A69}")
    dataObj.SetText "<table>" & .innerHtml & "</table>"
    dataObj.PutInClipboard
End With

Sheets(1).Paste Sheets(1).Cells(1, 1)

【讨论】:

  • 它可以正确处理数组大小,但不幸的是不能解决合并单元格导致错误数据输出的问题。
  • @RyszardJędraszyk - 错过了列跨度。查看编辑。
猜你喜欢
  • 1970-01-01
  • 2018-09-05
  • 2020-10-15
  • 1970-01-01
  • 2014-12-23
  • 2016-09-11
  • 2015-01-26
  • 2015-11-02
  • 2022-01-20
相关资源
最近更新 更多