RMaster - Dr. Kim's Homepage

글

6월, 2015의 게시물 표시

Web Harvesting + Dynamic HTML 다루기

- 6월 29, 2015

# TripAdvisor_review_image ------------------------------------------------ library(rvest) library(RSelenium) library(foreach) #웹주소 # 함수 ---------------------------------------------------------------------- expandMore <- function(wdd) { eles = wdd$findElements(using='css selector','span.moreLink') a=foreach(ele=eles) %do% tryCatch(ele$clickElement(),error=function(e){}) } goNextReviewPage <- function(wdd) { result <- tryCatch({ nextBtn = wdd$findElement(using='css selector','a.nav.next') nextBtn$clickElement() return(TRUE) },error=function(e){ return(FALSE) }) result } # Headless PhantomJS ------------------------------------------------------ pjs <- phantom() # Web Harvesting ---------------------------------------------------------- url = "http://www.tripadvisor.com/Restaurant_Review-g35805-d570063-Reviews-or20-Alinea-Chicago_Illinois.html#REVIEWS...

자세한 내용 보기

인터넷에서 그림 다운로드 받기

- 6월 29, 2015

이달 초에 아주대에서 특강을 했습니다. 그때 수업을 듣던 학생이 질문을 합니다. "어떻게 하면 인터넷에 있는 이미지를 다운로드 받지요?" 간단한 예를 들어보겠습니다. http://images.all-free-download.com/images/graphiclarge/disappearing_city_554541.jpg 이 주소의 이미지는... R에서 다운로드 받겠습니다. R> photo.url = "http://images.all-free-download.com/images/graphiclarge/disappearing_city_554541.jpg" R> download.file(photo.url,"free_sample.jpg",quiet=TRUE,mode='wb') free_sample.jpg라는 이름으로 다운로드 받았습니다.

자세한 내용 보기