All Questions
Tagged with web-scraping php
1,697
questions
0
votes
0
answers
40
views
Is there a way to pull remotely generated XML, process it, and display it on a Managed WordPress page?
I need to get some XSQL generated XML data from a remote server, process it, and display it on a GoDaddy Managed WordPress page. (This is the website my workplace has to use for various reasons.) ...
-1
votes
1
answer
53
views
Split text by <br> in PHP scraping result
I am writing a PHP scraping program. The program works smoothly for me but I found the scraping result slightly differs from my expectation.
Here is my script
$ch = curl_init();
curl_setopt($...
0
votes
0
answers
47
views
php scrape data and no response due to long waiting
I am writing a php to scrape data from internet. It works smoothly and now I want to go one step furhter. After scraping the name and the url of the source, I want to go to each link and scrape the ...
0
votes
0
answers
36
views
simple HTML dom - returns an object, instead of returning null if it doesn't find the element
im using something like:
$span = $value -> find('span', 0);
var_dump($span);
and i get:
object(voku\helper\SimpleHtmlDomBlank)#63 (2) { ["node":protected]=> NULL ["classListCache&...
0
votes
0
answers
41
views
Recognizing small images on screenshot by specific keyword
I need consultation, idea from experienced developers.
I am using tesseract OCR to recognize text on some screenshots from a web page but I can extract just text. My problem is that, if I find that ...
1
vote
0
answers
44
views
scrapping with Goutte\Client in many page
I´m building a scrapping script to verifry any data in this web. My script should visit this web and get all hrefs. In this step i haven´t got any problem.
In my next step i verify that my data in db ...
0
votes
0
answers
37
views
all data duplicate in my associative array
i´m building a script that get data from my db, and in this data i have a urls. In this URLs i need search in article if contain any words or phrase and if contain it, get link.
to create this logic, ...
1
vote
1
answer
54
views
Symfony HttpClient and Simple HTML DOM?
I use Symfony\Component\HttpClient\HttpClient to retrieve the page content because I use a proxy.
// Customizing the client
$this->httpClient = HttpClient::create(
[
'proxy' ...
1
vote
1
answer
90
views
Symfony/panther web Scraping not working for content behind login (Cloud function)
use \Symfony\Component\Panther\Client;
require __DIR__ . '/vendor/autoload.php';
$options = [
'--user-agent=Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/...
0
votes
1
answer
48
views
CdpPage.querySelectorAll is not a function while using puphpeteer
I was working with puphpeteer trying to get some info from an online shopping website. the library is working well and i even took some screenshots from the website and it looks ok. but when i try ...
0
votes
1
answer
172
views
Chromium or chrome needed in vercel deployed laravel project
Recently i tried to implement in one of my projects the ChromePHP library to use the headless browser provided by it in my app. tried using pupheteer and browsershot but ChromePHP worked the best for ...
0
votes
1
answer
302
views
php curl - emulate a request like a browser
i am trying to retrieve the content of the following page with PHP Curl:
https://www.whitepages.com/name/Antonio-Dalesio
The problem is that the page recognize that the request does not come from a ...
0
votes
1
answer
171
views
Node not found on laravel project
i've been trying to use a package called PuPHPeteer but i have a problem:
It seems like the package requires node to be used. https://github.com/zoonru/puphpeteer this is the package link. I installed ...
-3
votes
1
answer
73
views
How to get the current decimal/thousand separator for an unknown number
I have to insert into the database a number acquired from various sites for which I don't know what the decimal separator/thousand separator is.
Input numbers that express the value 1000 could be:
1,...
0
votes
0
answers
42
views
cURL in PHP - Handling "Not Secure" Warning on HTTP Site
I am using a PHP script in my wordpress based website that uses cURL to scrape content from a website. The site displays a "Not Secure" warning when accessed manually, but clicking "...