您可以尝试使用来自 SerpApi 的Google Scholar Organic Results API。这是一个带有免费计划的付费 API。
它通过专用代理、CAPTCHA 解决服务绕过搜索引擎的块、处理缩放、无需从头开始创建解析器并随着时间的推移对其进行维护。
代码和example to integrate with PHP in the online IDE:
<?php
ini_set('display_errors', 1);
ini_set('display_startup_errors', 1);
error_reporting(E_ALL);
require __DIR__ . '/vendor/autoload.php';
$queries = array(
"moon",
"pandas",
"python",
"data science",
"ML",
"AI",
"animals",
"amd",
"nvidia",
"intel",
"asus",
"robbery pi",
"latex, tex",
"amg",
"blizzard",
"world of warcraft",
"cs go",
"antarctica",
"fifa",
"amsterdam",
"usa",
"tesla",
"economy",
"ecology",
"biology"
);
foreach ($queries as $query) {
$params = [
"engine" => "google_scholar",
"q" => $query,
"hl" => "en"
];
$client = new GoogleSearch(getenv("API_KEY"));
$response = $client->get_json($params);
print_r("Extracting search query: {$query}\n");
foreach ($response->organic_results as $result) {
print_r("{$result->title}\n");
}
}
?>
代码和example code to integrate with Python:
from serpapi import GoogleScholarSearch
import os
queries = ["moon",
"pandas",
"python",
"data science",
"ML",
"AI",
"animals",
"amd",
"nvidia",
"intel",
"asus",
"robbery pi",
"latex, tex",
"amg",
"blizzard",
"world of warcraft",
"cs go",
"antarctica",
"fifa",
"amsterdam",
"usa",
"tesla",
"economy",
"ecology",
"biology"]
for query in queries:
params = {
"api_key": os.getenv("API_KEY"),
"engine": "google_scholar",
"q": query,
"hl": "en"
}
search = GoogleScholarSearch(params)
results = search.get_dict()
print(f"Extracting search query: {query}")
for result in results["organic_results"]:
print(result["title"])
输出:
Extracting search query: moon
Cellulose nanomaterials review: structure, properties and nanocomposites
Reflection in learning and professional development: Theory and practice
...
Extracting search query: biology
A new biology for a new century
The biology of mycorrhiza.
免责声明,我为 SerpApi 工作。