【问题标题】:Extract the last part of a facebook page url提取 facebook 页面 url 的最后一部分
【发布时间】:2016-06-30 15:14:51
【问题描述】:
【问题讨论】:
标签:
python
regex
url
entity
【解决方案1】:
try:
from urlparse import urlparse
except ImportError:
from urllib.parse import urlparse
links = [
'http://www.facebook.com/pages/Blue-Mountain-Aromatics/561694213861926',
'http://www.facebook.com/1905BocaJuniors',
'https://www.facebook.com/7upGuatemala?ref=br_tf',
'http://www.fb.com/supligenjm',
'http://www.facebook.com/axebolivia?sk=wall&filter=1',
]
for url in links:
url = urlparse(url)
path = url.path.split('/')
entity = path[2] if path[1] == 'pages' else path[1]
print(entity)
【解决方案2】:
@Robᵩs 答案的 Python 3 版本(并重写为函数):
from urllib.parse import urlparse
links = [
'http://www.facebook.com/pages/Blue-Mountain-Aromatics/561694213861926',
'http://www.facebook.com/1905BocaJuniors',
'https://www.facebook.com/7upGuatemala?ref=br_tf',
'http://www.fb.com/supligenjm',
'http://www.facebook.com/axebolivia?sk=wall&filter=1',
]
def fb_extract(url):
url = urlparse(url)
path = url.path.split('/')
entity = path[2] if path[1] == 'pages' else path[1]
return entity
for url in links:
fb_extract(url)
希望这会有所帮助!