【发布时间】:2013-02-07 17:28:25
【问题描述】:
我为此搜索了很多,但找不到解决方案。这是a similar question,在 java 中有一个可能的解决方案。
Python中是否有类似的解决方案?
【问题讨论】:
我为此搜索了很多,但找不到解决方案。这是a similar question,在 java 中有一个可能的解决方案。
Python中是否有类似的解决方案?
【问题讨论】:
除了 Selenium,此示例还需要 PIL Imaging 库。有时这是作为标准库之一,有时不是,但如果你没有它,你可以使用pip install Pillow安装它
from selenium import webdriver
from PIL import Image
from io import BytesIO
fox = webdriver.Firefox()
fox.get('http://stackoverflow.com/')
# now that we have the preliminary stuff out of the way time to get that image :D
element = fox.find_element_by_id('hlogo') # find part of the page you want image of
location = element.location
size = element.size
png = fox.get_screenshot_as_png() # saves screenshot of entire page
fox.quit()
im = Image.open(BytesIO(png)) # uses PIL library to open image in memory
left = location['x']
top = location['y']
right = location['x'] + size['width']
bottom = location['y'] + size['height']
im = im.crop((left, top, right, bottom)) # defines crop points
im.save('screenshot.png') # saves new cropped image
最后的输出是...... Stackoverflow 标志!!!
当然,这对于仅抓取静态图像来说是多余的,但如果您想抓取需要 Javascript 才能实现的东西,这可能是一个可行的解决方案。
【讨论】:
img = Image.open(StringIO(base64.decodestring(driver.get_screenshot_as_base64())))
img = fox.get_screenshot_as_png(),然后img = Image.open(StringIO(img)) 将其加载为 PIL 图像。
BytesIO 而不是StringIO。
在 python3.5 中为我工作
from selenium import webdriver
fox = webdriver.Firefox()
fox.get('http://stackoverflow.com/')
image = fox.find_element_by_id('hlogo').screenshot_as_png
附言
保存到文件
image=driver.find_element_by_id('hlogo').screenshot(output_file_path)
【讨论】:
image = driver.find_element_by_id('el_id').screenshot_as_png 这是元素的属性,我如何将其保存为图像?
image = driver.find_element_by_id('el_id').screenshot(output_file_path)。 API文档请参考here。
我写了这个有用的python3函数。
from base64 import b64decode
from wand.image import Image
from selenium.webdriver.remote.webelement import WebElement
from selenium.webdriver.common.action_chains import ActionChains
import math
def get_element_screenshot(element: WebElement) -> bytes:
driver = element._parent
ActionChains(driver).move_to_element(element).perform() # focus
src_base64 = driver.get_screenshot_as_base64()
scr_png = b64decode(src_base64)
scr_img = Image(blob=scr_png)
x = element.location["x"]
y = element.location["y"]
w = element.size["width"]
h = element.size["height"]
scr_img.crop(
left=math.floor(x),
top=math.floor(y),
width=math.ceil(w),
height=math.ceil(h),
)
return scr_img.make_blob()
它以字节返回显示元素的 png 图像。
限制:元素必须适合视口。
您必须安装 wand 模块才能使用它。
【讨论】:
x = element.location_once_scrolled_into_view["x"] y = element.location_once_scrolled_into_view["y"] 因为 location 可能会返回比窗口大的 y。
这是一个功能,在传递给裁剪函数之前,大小必须转换为整数:
from PIL import Image
from StringIO import StringIO
def capture_element(element,driver):
location = element.location
size = element.size
img = driver.get_screenshot_as_png()
img = Image.open(StringIO(img))
left = location['x']
top = location['y']
right = location['x'] + size['width']
bottom = location['y'] + size['height']
img = img.crop((int(left), int(top), int(right), int(bottom)))
img.save('screenshot.png')
【讨论】:
TypeError: initial_value must be str or None, not bytes
为了响应 RandomPhobia 的非常好的答案,扩展 cmets,这里有两个具有正确导入语句的解决方案,它们将打开全屏屏幕截图而无需先保存到文件:
from selenium import webdriver
from PIL import Image
from StringIO import StringIO
import base64
DRIVER = 'chromedriver'
browser = webdriver.Chrome(DRIVER)
browser.get( "http:\\\\www.bbc.co.uk" )
img 1 = Image.open(StringIO(base64.decodestring(browser.get_screenshot_as_base64())))
img 2 = Image.open(StringIO(browser.get_screenshot_as_png()))
因为我确定您的下一个问题是,“这很好,但哪个最快?”,这里是如何确定它(我发现第一种方法在一定距离内是最快的):
import timeit
setup = '''
from selenium import webdriver
from PIL import Image
from StringIO import StringIO
import base64
DRIVER = 'chromedriver'
browser = webdriver.Chrome(DRIVER)
browser.get( "http:\\\\www.bbc.co.uk" )
file_name = 'tmp.png'
'''
print timeit.Timer('Image.open(StringIO(browser.get_screenshot_as_png()))', setup=setup).repeat(2, 10)
print timeit.Timer('Image.open(StringIO(base64.decodestring(browser.get_screenshot_as_base64())))', setup=setup).repeat(2, 10)
print timeit.Timer('browser.get_screenshot_as_file(file_name); pil_img = Image.open(file_name)', setup=setup).repeat(2, 10)
【讨论】:
元素截图:
from PIL import Image
from io import BytesIO
image = self.browser.driver.find_element_by_class_name('example.bla.bla').screenshot_as_png
im = Image.open(BytesIO(image)) # uses PIL library to open image in memory
im.save('example.png')
【讨论】:
WebDriverException: Message: unknown error: failed to parse value of getElementRegion (Session info: chrome=78.0.3904.108)
就这么简单:
element = driver.find_element_by_class_name('myclass')
element.screenshot('screenshot.png')
【讨论】:
我将@randomphobia 的答案转换为一个函数。我还使用了@bummis 的建议,即使用location_once_scrolled_into_view 而不是location,以便概括页面大小。
from selenium import webdriver
from PIL import Image
from io import BytesIO
def take_screenshot(element, driver, filename='screenshot.png'):
location = element.location_once_scrolled_into_view
size = element.size
png = driver.get_screenshot_as_png() # saves screenshot of entire page
im = Image.open(BytesIO(png)) # uses PIL library to open image in memory
left = location['x']
top = location['y']
right = location['x'] + size['width']
bottom = location['y'] + size['height']
im = im.crop((left, top, right, bottom)) # defines crop points
im.save(filename) # saves new cropped image
这里有一个要点:https://gist.github.com/WittmannF/b714d3ceb7b6a5cd50002f11fb5a4929
【讨论】: