【问题标题】:Reading specific text and writing it to multiple lines by breaking at specific character [duplicate]读取特定文本并通过在特定字符处中断将其写入多行[重复]
【发布时间】:2021-09-05 12:51:05
【问题描述】:

我在一个文本文件中有如下文本。一条龙

我想读取标签之间的文本并打印它,我想通过在“~”处中断来打印多行,如下所示。我怎么能在python中做到这一点

Line 1- This is the data you requested~
Line 2- Use this data for processing~
Line 3 - If processing fails contact us @ 188-500-5678~


<?xml version="1.0" encoding="UTF-8"?>
<env:Envelope xmlns:env="http://www.w3.org/2003/05/soap-envelope"><env:Header/><env:Body><re:EnvelopeResponse xmlns:core="http://www.AAAA.org/SOAP/SDL/Rule21.1.0.xsd"><PayloadType>Response</PayloadType><ProcessingMode>R</ProcessingMode><PayloadID>42eaf653</PayloadID><TimeStamp>2021-06-01</TimeStamp><SenderID>X</SenderID><ReceiverID>X</ReceiverID><RuleVersion>1.1.0</RuleVersion><Payload>This is the data you requested~Use this data for processing~If processing fails contact us @ 188-500-5678~</Payload><ErrorCode>Success</ErrorCode><ErrorMessage/></R:EnvelopeResponse></env:Body></env:Envelope>

【问题讨论】:

  • "all in one line" 但是你展示的并不是所有在一行中。
  • @mkrieger1 认为他的意思是文本文件都在一行中,他想将其分解

标签: python


【解决方案1】:

这样你就可以提取payload了:

from bs4 import BeautifulSoup

data = """<?xml version="1.0" encoding="UTF-8"?><env:Envelope xmlns:env="http://www.w3.org/2003/05/soap-envelope"><env:Header/><env:Body><re:EnvelopeResponse xmlns:core="http://www.AAAA.org/SOAP/SDL/Rule21.1.0.xsd"><PayloadType>Response</PayloadType><ProcessingMode>R</ProcessingMode><PayloadID>42eaf653</PayloadID><TimeStamp>2021-06-01</TimeStamp><SenderID>X</SenderID><ReceiverID>X</ReceiverID><RuleVersion>1.1.0</RuleVersion><Payload>This is the data you requested~Use this data for processing~If processing fails contact us @ 188-500-5678~</Payload><ErrorCode>Success</ErrorCode><ErrorMessage/></R:EnvelopeResponse></env:Body></env:Envelope>"""
soup = BeautifulSoup(data, "xml")
items = soup.find_all("Payload")
print (items)

您可以使用 \n 来设置换行符。你可以用 replace() 来做到这一点。您还可以使用 replace() 删除字符串中的标签。

【讨论】:

    猜你喜欢
    • 2017-12-29
    • 1970-01-01
    • 2020-09-14
    • 2023-03-24
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 1970-01-01
    • 2016-08-01
    相关资源
    最近更新 更多