Practicando un poco de Web Scraping con 3Speak ['SPA','ENG']

Hoy me tome la molestia de seguir practicando comandos básicos con “pupperteer”, una tecnología que me solicitaron para un trabajo a la hora de obtener información de paginas web, ya sabia de la existencia de este tipo de técnicas pero nunca me anime a utilizarlas por pensé que era algo complicado, pero esta misma herramienta que utilizo me facilita mucho el trabajo.

Today I took the trouble to continue practicing basic commands with “pupperteer”, a technology that was requested of me for a job when it came to obtaining information from web pages, I already knew of the existence of this type of techniques but I never dared to use them for I thought it was somewhat complicated, but this same tool that I use makes my work a lot easier.

scrap.jpg

scrap1.jpg
A la hora de aplicar esta tecnología en “3Speak” me di cuenta que quien diseño el lado del cliente no aplica buenas practicas con respecto al uso correcto de las etiquetas y estilos, asi que me complico un poco la obtención de los datos por medio de selectores, pero nada imposible de hacer.
En este caso utilicé un selector general para capturar todas las etiquetas que contengan dentro el título de cada video, de los cuales los itere con método de las matrices y solicite el contenido textual de cada elemento creando otra matriz que almacene cada cadena de texto con el nombre de cada video.

When applying this technology in “3Speak” I realized that whoever designed the client side does not apply good practices regarding the correct use of labels and styles, so it became a little complicated for me to obtain the data through selectors, but nothing impossible to do. In this case I used a general selector to capture all the tags that contain the title of each video, of which I iterated them with the array method and requested the textual content of each element by creating another array that stores each text string with the name of each video.

scrap2.jpg

scrap3.jpg

scrap4.jpg

I love this tool



0
0
0.000
1 comments
avatar

Thanks for your contribution to the STEMsocial community. Feel free to join us on discord to get to know the rest of us!

Please consider delegating to the @stemsocial account (85% of the curation rewards are returned).

You may also include @stemsocial as a beneficiary of the rewards of this post to get a stronger support. 
 

0
0
0.000