theBioBucket*: 2012

22 Dec 2012

Convert OpenStreetMap Objects to KML with R

A quick geo-tip:
With the osmar and maptools package you can easily pull an OpenStreetMap object and convert it to KML, like below (thanks to adibender helping out on SO). I found the relation ID by googling for it (www.google.at/search?q=openstreetmap+relation+innsbruck).

# get OSM data
library(osmar)
library(maptools)
 
innsbruck <- get_osm(relation(113642), full = T)
sp_innsbruck <- as_sp(innsbruck, what = "lines")
 
# convert to KML
for( i in seq_along(sp_innsbruck) ) {
      kmlLine(sp_innsbruck@lines[[i]], kmlfile = "innsbruck.kml",
               lwd = 3, col = "blue", name = "Innsbruck") 
      }
 
shell.exec("innsbruck.kml")

16 Dec 2012

Taxonomy with R: Exploring the Taxize-Package

http://upload.wikimedia.org/wikipedia/commons/6/68/Ernst_Haeckel_-_Tree_of_Life.jpg

First off, I'd really like to give a shout-out to the brave people who have created and maintain this great package - the fame is yours!

So, while exploring the capabilities of the package some issues with the ITIS-Server arose and with large datasets things weren't working out quite well for me.
I then switched to the NCBI API and saw that the result is much better here (way quicker, on first glance also a higher coverage).
At the time there is no taxize-function that will pull taxonomic details from a classification returned by NCBI, that's why I plugged together a little wrapper - see here:

# some species data:
spec <- data.frame("Species" = I(c("Bryum schleicheri", "Bryum capillare", "Bryum argentum", "Escherichia coli", "Glis glis")))
spl <- strsplit(spec$Species, " ")
spec$Genus <- as.character(sapply(spl, "[[", 1))

# for pulling taxonomic details we'd best submit higher rank taxons
# in this case Genera. Then we'll submit Genus Bryum only once and 
# save some computation time (might be an issue if you deal 
# with large datasets..)

gen_uniq <- unique(spec$Genus)

# function for pulling classification details ("phylum" in this case)
get_sys_level <- function(x){ require(taxize)
                              a <- classification(get_uid(x))
                              y <- data.frame(a[[1]])                                        # if there are multiple results, take the first..
                              z <- tryCatch(as.character(y[which(y[,2] == "phylum"), 1]),    # in case of any other errors put NA
                                            error = function(e) NA)
                              z <- ifelse(length(z) != 0, z, NA)                             # if the taxonomic detail is not covered return NA
                              return(data.frame(Taxon = x, Syslevel = z))
                             }

# call function and rbind the returned values 
result <- do.call(rbind, lapply(gen_uniq, get_sys_level))
print(result)
#         Taxon       Syslevel
# 1       Bryum   Streptophyta
# 2 Escherichia Proteobacteria
# 3        Glis       Chordata

# now merge back to the original data frame
spec_new <- merge(spec, result, by.x = "Genus", by.y = "Taxon")
print(spec_new)
#         Genus           Species       Syslevel
# 1       Bryum Bryum schleicheri   Streptophyta
# 2       Bryum   Bryum capillare   Streptophyta
# 3       Bryum    Bryum argentum   Streptophyta
# 4 Escherichia  Escherichia coli Proteobacteria
# 5        Glis         Glis glis       Chordata
#

28 Nov 2012

So, What Are You? ..A Plant? ..An Animal? -- Nope, I'm a Fungus!

Lately I had a list of about 1000 species names and I wanted to filter out only the plants as that is where I come from. I knew that Scott Chamberlain has put together the ritis package which obviously can do such things. However, I knew of ITIS before and was keen to give it a shot..

Here's what I've come up with (using the ITIS API, updated on 11. Dec 2012, previous version had a flaw with indefinite matches.. Should be ok now. However, there are of course species that are not covered by the database, i.e. Ixodes, see below):

library(XML)
get_tsn <- function(sp_name) {
           require(XML)
           units <- tolower(unlist(strsplit(sp_name, " ")))

           # valid string?
           if (length(units) > 2) { stop("...No valid search string submitted (two words seperated by one space)!") }

           itis_xml <- htmlParse(paste("http://www.itis.gov/ITISWebService/services/ITISService/searchByScientificName?srchKey=", 
                                       sp_name, sep=""))
           tsn <- xpathSApply(itis_xml, "//tsn", xmlValue)
           unitname1 <- tolower(gsub("\\s+", "", xpathSApply(itis_xml, "//unitname1", xmlValue)))
           unitname2 <- tolower(gsub("\\s+", "", xpathSApply(itis_xml, "//unitname2", xmlValue)))
           unitname3 <- tolower(gsub("\\s+", "", xpathSApply(itis_xml, "//unitname3", xmlValue)))

           # sp_name = only Genus, get tsn were sp_name matches perfectly and unitname2 (lower level taxon) is absent 
           if (length(units) == 1) {
               return(tsn[tolower(sub("\\s+", "", unitname1)) == tolower(sp_name) & unitname2 == ""]) }

           # sp_name = Genus and Epitheton, get tsn where both match perfectly and unitname3 (lower level taxon) is absent 
           if (length(units) == 2) {
               return(tsn[unitname1 == units[1] & 
                          unitname2 == units[2] &
                          nchar(unitname3) == 0]) }
           }

get_kngdm <- function(tsn) {
                   kngdm <- xpathSApply(htmlParse(paste("http://www.itis.gov/ITISWebService/services/ITISService/getKingdomNameFromTSN?tsn=", 
                                                       tsn, sep="")), 
                                                  "//kingdomname", xmlValue)
           return(kngdm)
           }

get_tsn_kngdm <- function(x) {y = get_tsn(x)
                              z = get_kngdm(y)
                              return(list(Name = x, TSN = y, Kingdom = z))
                              }

# I had some API-related errors (I guess it was mysteriously not answering in 
# some cases). I couldn't resolve this and thus implemented tryCatch
get_tsn_kngdm_try <- function(x) tryCatch(get_tsn_kngdm(x), error = function(e) NULL)

sp_names <- c("Clostridium", "Physcia", "Ixodes", "LYNX", "Homo sapiens", "Canis lupus")

system.time(result <- data.frame(do.call(rbind, lapply(sp_names, FUN = get_tsn_kngdm_try))))
result

system.time(result <- data.frame(do.call(rbind, lapply(sp_names, FUN = get_tsn_kngdm_try))))
#
# result
#        User      System verstrichen 
#        1.54        0.01       33.66 
#           Name    TSN  Kingdom
# 1  Clostridium 555645   Monera
# 2      Physcia  14024    Fungi
# 3        Viola  22030  Plantae
# 4       Ixodes                
# 5         LYNX 180581 Animalia
# 6 Homo sapiens 180092 Animalia
# 7  Canis lupus 180596 Animalia
#

20 Nov 2012

Add Comments in MS-Word using VBA

This VBA procedure (1) Searches words ("Str1", "Str2",..) and adds commentboxes to it and (2) changes the initials used in the box:

Sub CommentBox()

    Dim range As range
    Dim i As Long
    Dim TargetList
    Dim cm As Comment
    
    TargetList = Array("Str1", "Str2")
    
    For i = 0 To UBound(TargetList)
    
    Set range = ActiveDocument.range
    
    With range.Find
    .Text = TargetList(i)
    .Format = True
    .MatchCase = False
    .MatchWholeWord = True
    .MatchWildcards = False
    .MatchSoundsLike = False
    .MatchAllWordForms = False
        
    Do While .Execute(Forward:=True) = True

        Set cm = range.Comments.Add(range:=range, Text:="Hallo")
        cm.Author = "InternalName"
        cm.Initial = "Initials"
        
        Loop
    
    End With

    Next

With ActiveDocument.Styles("Kommentartext").Font
        .Size = 12
        .Bold = True
        .Italic = True
        .Color = wdColorBlue
End With

End Sub

12 Nov 2012

WIKI Search in Excel with VBA

Here's a VBA code snippet for searching a string in a cell in WIKIPEDIA:

Sub wiki()

Dim searchstr As String
Dim searchsadd As String

searchstr = ActiveCell.Value
searchadd = "http://en.wikipedia.org/w/index.php?title=Special%3ASearch&profile=default&search=" & searchstr & "&fulltext=Search"
  
  If Len(searchstr) = 0 Then
    MsgBox "The active cell is empty.. Nothing to search for..", vbOKOnly, "WIKIPEDIA"

  Else:
    ActiveWorkbook.FollowHyperlink searchadd, NewWindow:=True
    Application.StatusBar = "WIKI search for: " & searchstr
  End If
  
End Sub

6 Nov 2012

Calculate Single Contour-Line from DEM with QGIS / GDAL

In QGIS:

- from menu: Raster / Extraction / Contour

- define output name path/to/output.shp

- alter GDAL call for single contour line at 900 m asl:


gdal_contour -fl 900 "path/to/dem_raster.asc" "path/to/output.shp"

- for removing small poplygons or lines add area or length field (attr table / field calc or vector / geometry / add geometry)

- query by length or area to deselect unwanted iso-lines

Finally, you can export the contours as KML and check it in Google Earth:

29 Sept 2012

Merging Dataframes by Partly Matching String

The latest posting by Tony Hirst sparked my attention because I was thinking about a very similar issue recently.

I was also fiddling around with agrep and adist until I realised that for this very issue matching of substrings is not as important as matching multiple words.. With this different approach I quite easily matched all but 3 countries.

See what I did:

## look up matches of one dataframe in another dataframe.
## the strings to be matched are comprised of 1 or more words 
## and seperated by white space.
## method: match strings that have the highest fraction of words that match up

d1 <- read.csv("http://s.telegraph.co.uk/graphics/conrad/PercentageUsingTheNet.csv", 
               header = T, sep = ",", encoding = "UTF-8")
d2 <- read.csv("http://www.iso.org/iso/country_names_and_code_elements_txt",
               header = T, sep = ";", encoding = "UTF-8")

## strings to be compared d1$ECONOMY and d2$Country.Name
mystr.1 <- as.character(d1$ECONOMY)
mystr.2 <- as.character(d2$Country.Name)
mystr.3 <- as.character(d2$ISO.3166.1.alpha.2.code)

## remove punctuation and multiple spaces
mystr.1 <- tolower(gsub("[^[:alnum:][:space:]]", "", mystr.1))
mystr.1 <- gsub("\\s+", " ", mystr.1)
mystr.2 <- tolower(gsub("[^[:alnum:][:space:]]", "", mystr.2))
mystr.2 <- gsub("\\s+", " ", mystr.2)

## function that finds matching words in string (words seperated by single space!)
n.wordshared <- function(x, y) {
    sum(!is.na(match(unlist(strsplit(x, " ")),
                     unlist(strsplit(y, " ")))
         )
        )
    }
## example
n.wordshared(x = "hello world", y = "good bye world")
## [1] 1

## function that calculates fraction of shared words
fr.wordshared <- function(x, y) {
                     n.wordshared(x, y) / (length(unique(unlist(strsplit(x, " "))))
                                           + length(unique(unlist(strsplit(y, " ")))))
                          }
## example
fr.wordshared(x = "hello world", y = "good bye world")
## [1] 0.2

mydf <- data.frame(str1 = mystr.1, mymatch = "", match.iso = "",
                   stringsAsFactors = F)

## now look up every element of string 1 in string 2
## and if there are matching words assign match to dataframe
for (i in 1:nrow(mydf)) {
   xx <- sapply(mystr.2, fr.wordshared, y = mystr.1[i])
   if (sum(xx) == 0) {
     mydf$mymatch[i] <- NA
     mydf$match.iso[i] <- NA
     } else {
     mydf$mymatch[i] <- paste(names(which(xx == max(xx))), collapse = "; ")
     mydf$match.iso[i] <- paste(mystr.3[as.integer(which(xx == max(xx)))], collapse = "; ")
   }
}

## see result
print(mydf)

## these are the multiple matches
(aa <- mydf[grep(";", mydf$mymatch), ])
##
##               str1                            mymatch match.iso
## 28 slovak republic czech republic; dominican republic    CZ; DO


## these were not matched
(bb <- mydf[is.na(mydf$mymatch), ])
##      str1 mymatch match.iso
##
## 61  russia     NA        NA
## 108  syria     NA        NA

Now, expanding on this concept by introduction of partial matching would most propably result in a 100% match...

28 Sept 2012

Reading and Text Mining a PDF-File in R

I just added this R-script that reads a PDF-file to R and does some text mining with it to my Github repo..

6 Sept 2012

Get Long-Term Climate Data from KNMI Climate Explorer

You can query global climate data from the KNMI Climate Explorer (the KNMI is the Royal Netherlands Metereological Institute) with R.

Here's a little example how I retreived data for my hometown Innsbruck, Austria and plotted annual total precipitation. You can choose station data by pointing at a map, by setting coordinates, etc.

Follow-Up: Making a Word Cloud for a Search Result from GScholar_Scraper_3.1

Here's a short follow-up on how to produce a word cloud for a search result from GScholarScraper_3.1:

# File-Name: GScholarScraper_3.1.R
# Date: 2012-08-22
# Author: Kay Cichini
# Email: kay.cichini@gmail.com
# Purpose: Scrape Google Scholar search result
# Packages used: XML
# Licence: CC BY-SA-NC
#
# Arguments:
# (1) input:
# A search string as used in Google Scholar search dialog
#
# (2) write:
# Logical, should a table be writen to user default directory?
# if TRUE ("T") a CSV-file with hyperlinks to the publications will be created.
#
# Difference to version 3:
# (3) added "since" argument - define year since when publications should be returned..
# defaults to 1900..
#
# (4) added "citation" argument - logical, if "0" citations are included
# defaults to "1" and no citations will be included..
# added field "YEAR" to output 
#
# Caveat: if a submitted search string gives more than 1000 hits there seem
# to be some problems (I guess I'm being stopped by Google for roboting the site..)
#
# And, there is an issue with this error message:
# > Error in htmlParse(URL): 
# > error in creating parser for http://scholar.google.com/scholar?q
# I haven't figured out his one yet.. most likely also a Google blocking mechanism..
# Reconnecting / new IP-address helps..


GScholar_Scraper <- function(input, since = 1900, write = F, citation = 1) {

    require(XML)

    # putting together the search-URL:
    URL <- paste("http://scholar.google.com/scholar?q=", input, "&as_sdt=1,5&as_vis=", 
                 citation, "&as_ylo=", since, sep = "")
    cat("\nThe URL used is: ", "\n----\n", paste("* ", "http://scholar.google.com/scholar?q=", input, "&as_sdt=1,5&as_vis=", 
                 citation, "&as_ylo=", since, " *", sep = ""))
    
    # get content and parse it:
    doc <- htmlParse(URL)
    
    # number of hits:
    h1 <- xpathSApply(doc, "//div[@id='gs_ab_md']", xmlValue)
    h2 <- strsplit(h1, " ")[[1]][2] 
    num <- as.integer(sub("[[:punct:]]", "", h2))
    cat("\n\nNumber of hits: ", num, "\n----\n", "If this number is far from the returned results\nsomething might have gone wrong..\n\n", sep = "")
    
    # If there are no results, stop and throw an error message:
    if (num == 0 | is.na(num)) {
        stop("\n\n...There is no result for the submitted search string!")
    }
    
    pages.max <- ceiling(num/100)
    
    # 'start' as used in URL:
    start <- 100 * 1:pages.max - 100
    
    # Collect URLs as list:
    URLs <- paste("http://scholar.google.com/scholar?start=", start, "&q=", input, 
                  "&num=100&as_sdt=1,5&as_vis=", citation, "&as_ylo=", since, sep = "")
    
    scraper_internal <- function(x) {
        
        doc <- htmlParse(x, encoding="UTF-8")
        
        # titles:
        tit <- xpathSApply(doc, "//h3[@class='gs_rt']", xmlValue)
        
        # publication:
        pub <- xpathSApply(doc, "//div[@class='gs_a']", xmlValue)
        
        # links:
        lin <- xpathSApply(doc, "//h3[@class='gs_rt']/a", xmlAttrs)
        
        # summaries are truncated, and thus wont be used..  
        # abst <- xpathSApply(doc, '//div[@class='gs_rs']', xmlValue)
        # ..to be extended for individual needs
        options(warn=(-1))
        dat <- data.frame(TITLES = tit, PUBLICATION = pub, 
                          YEAR = as.integer(gsub(".*\\s(\\d{4})\\s.*", "\\1", pub)),
                          LINKS = lin)
        options(warn=0)
        return(dat)
    }

    result <- do.call("rbind", lapply(URLs, scraper_internal))
    if (write == T) {
      result$LINKS <- paste("=Hyperlink(","\"", result$LINKS, "\"", ")", sep = "")
      write.table(result, "GScholar_Output.CSV", sep = ";", 
                  row.names = F, quote = F)
      shell.exec("GScholar_Output.CSV") 
      } else {
      return(result)
    }
}

# EXAMPLE:

input <- "allintitle:amphibian+diversity"
df <- GScholar_Scraper(input, since = 1980, citation = 1)

#install.packages("tm")
library(tm)

#install.packages("wordcloud")
library(wordcloud)

corpus <- Corpus(VectorSource(df$TITLES))
corpus <- tm_map(corpus, function(x)removeWords(x, c(stopwords(), "PDF", "B", "DOC", "HTML", "BOOK", "CITATION")))
corpus <- tm_map(corpus, removePunctuation)
tdm <- TermDocumentMatrix(corpus)
m <- as.matrix(tdm)
v <- sort(rowSums(m), decreasing = TRUE)
d <- data.frame(word = names(v), freq = v)

# remove numbers from strings:
d <- d[-grep("[0-9]", d$word), ]

# print wordcloud:
wordcloud(d$word, d$freq)





Labels:

Google Scholar


                                              ,
                                            

GScholarScraper()


                                              ,
                                            

R


                                              ,
                                            

Word-Cloud













5
Kommentare


                                        

                                        
                                      


25 Aug 2012



                                        
                                      





Toy Example with GScholarScraper_3.1






A commentator on my blog brought up this nice idea of how to use the GScholarScraper function for bibliometrics..

I altered the code a little bit which enables to set a year since when results should be returned and added a field to the output collecting the year of publication. With this you can simply do something like this:















input <- "intitle:metapopulation"
df <- GScholar_Scraper(input, since = 1980, citation = 1)
nrow(df)
hist(df$YEAR, xlab = "Year", 
     main = "Frequency of Publications with\n\"METAPOPULATION\" in Title")





Labels:

Google Scholar


                                              ,
                                            

GScholarScraper()


                                              ,
                                            

R













0
Kommentare











                                          
                                        

                                        
                                      


22 Aug 2012



                                        
                                      





Web-Scraper for Google Scholar Updated!






I have updated the Google Scholar Web-Scraper Function GScholarScaper_2 to GScholarScraper_3 (and GScholarScaper_3.1) as it was outdated due to changes in the Google Scholar html-code. The new script is more slender and faster. It returns a dataframe or optionally a CSV-file with the titles, authors, publications & links. Feel free to report bugs, etc.







Update 11-07-2013: bug fixes due to google scholar code changes - https://github.com/gimoya/theBioBucket-Archives/blob/master/R/Functions/GScholarScraper_3.2.R. Note that since lately your IP will be blocked by Google at about the 1000th search result (cumulated) - so there's not much fun when you want to do some extensive bibliometrics..





Labels:

Google Scholar


                                              ,
                                            

R


                                              ,
                                            

XML-Package













12
Kommentare











                                          
                                        

                                        
                                      


11 Aug 2012



                                        
                                      





Tcl/Tk GUI Example with Variable Input by User








I recently used R with GUI-elements for the first time and browsed through the available online resources, but I didn't quite find what I was searching for: The user should be able to put in some variables and call a function with a button. In the end I did it with a little help from SO. Here is the working example that I eventually plugged together:



  







Read more »





Labels:

R


                                              ,
                                            

tcltk













3
Kommentare











                                          
                                        

                                        
                                      


29 Jun 2012



                                        
                                      





Use IUCN API with R & XPath










Thanks to a posting on R-sig-eco mailing list I learned of the IUCN-API. Here's a simple example for what can be done with it (output as pdf is HERE):









Read more »





Labels:

GIS


                                              ,
                                            

IUCN


                                              ,
                                            

Maps


                                              ,
                                            

maptools


                                              ,
                                            

R


                                              ,
                                            

XPath













7
Kommentare











                                          
                                        

                                        
                                      


25 Jun 2012



                                        
                                      





Avoid Overplotting of Text in Ordination Diagram






Referring to a recent posting on r-sig-eco mailing list I'll add this example to theBioBucket:





























library(vegan)
library(vegan)
data(dune)
sol <- metaMDS(dune)
 
# use ordipointlabel -
# here is an example where I added cex according to species frequencies:
plot(sol, type = "n")
cex.lab = colSums(dune > 0) / nrow(dune) + 1
col.lab = rgb(0.2, 0.5, 0.4, alpha = 0.6)
ordipointlabel(sol, displ = "sp", col = col.lab, cex = cex.lab)
 
 
# you could also use pointLabel() from maptools package:
library(maptools)
x = as.vector(sol$species[,1])
y = as.vector(sol$species[,2])
w = row.names(sol$species)
 
plot(sol, type = "n")
points(sol, displ = "species", cex = 1, pch = 4, col = 3)
pointLabel(x, y, w, col = col.lab, cex = cex.lab)





Labels:

metaMDS


                                              ,
                                            

Ordination diagram


                                              ,
                                            

R













0
Kommentare











                                          
                                        

                                        
                                      


19 Jun 2012



                                        
                                      





A Wrapper Function for Instant Package Installation / Loading






Since library() and require() only accept input with length(input) = 1 it is necessary to make repeated calls - this can be quite annoying.. So, HERE is a little wrapper function for convenient package installation / loading. It installs packages if they are missing and loads them if there were not loaded yet. I have put it to my RProfile.site so I can conveniently install / load packages with only one call and am quite content with it..





Labels:

Packages


                                              ,
                                            

R













0
Kommentare











                                          
                                        

                                        
                                      


11 Jun 2012



                                        
                                      





FloraWeb Plant Species Report via R






For German-spoken users I added the function floraweb_scrape.R that allows you to conveniently collect species data and print to a PDF-file (see this example output). The function accesses data provided by the  web-site FloraWeb.de (BfN - Bundesministerium für Naturschutz).

You can use it as an interactive version (RTclTk) which I have put to a Github repository HERE. 



Preview: 














Labels:

FloraWeb


                                              ,
                                            

R


                                              ,
                                            

RCurl


                                              ,
                                            

XML-Package













0
Kommentare











                                          
                                        

                                        
                                      


14 May 2012



                                        
                                      





Source R-Script from Dropbox






A quick tip on how to source R-scripts from a Dropbox-account:



(1) Upload the script.. 



(2) Get link with the "get link" option. The link should look like "https://www.dropbox.com/s/XXXXXX/yourscript.R"..



(3) Grab this part "XXXXXX/yourscript.R" and paste it to "http://dl.dropbox.com/s/"..



(4) the final URL that can be sourced:

source("http://dl.dropbox.com/s/XXXXXX/yourscript.R")
..an example with this script stored at my Dropbox account:

source("http://dl.dropbox.com/s/c18lcwnnrodsevt/test_dropbox_source.R")


EDIT, March 2013:

This method is not working anymore. You can use the following approach instead:



library(RCurl)
setwd(tempdir())

destfile = "test.txt"
x = getBinaryURL("https://dl.dropbox.com/u/68286640/test_dropbox_source.R", followlocation = TRUE, ssl.verifypeer = FALSE)
writeBin(x, destfile, useBytes = TRUE)
source(paste(tempdir(), "/test.txt", sep = ""))

# remove files from tempdir:
unlink(dir())






Labels:

Dropbox


                                              ,
                                            

R













3
Kommentare











                                          

                                        

                                        
                                      


2 May 2012



                                        
                                      





knitr-Example: Use World Bank Data to Generate Report for Threatened Bird Species






I'll use the below script that retrieves data for threatened bird species from the World Bank via its API and does some processing, plotting and analysis. There is a package (WDI) that allows you to access the data easily.



















# world bank indicators for species - 
# I'll check bird species:
code <- as.character(WDIsearch("bird")[1,1])
bird_data <- WDI(country="all", indicator=code, start=2010, end=2012)

# remove NAs and select values in the range 50 - 1000:
bird_data_sub <- bird_data[!is.na(bird_data$EN.BIR.THRD.NO)&
                           bird_data$EN.BIR.THRD.NO < 1000&
                           bird_data$EN.BIR.THRD.NO > 50, ]

# change in numbers across years 2010 and 2011:
change.no <- aggregate(EN.BIR.THRD.NO ~ country, diff,
                       data = bird_data_sub)
# plot:
par(mar = c(3, 3, 5, 1))
plot(x = change.no[,2], y = 1:nrow(change.no),
     xlim = c(-12, 12), xlab = "", ylab = "",
     yaxt = "n")
abline(v = 0, lty = 2, col = "grey80")
title(main = "Change in Threatened Bird Species in\nCountries with Rich Avifauna (>50)")
text(y = 1:nrow(change.no), 
     x = -2, adj = 1,
     labels = change.no$country)
segments(x0 = 0, y0 = 1:nrow(change.no),
         x1 = change.no[, 2], y1 =  1:nrow(change.no))

# test hypothesis that probability of species decrease is
# equal to probability of increase:
binom.test(sum(change.no < 0), sum(change.no != 0))
For generating the report you can source the script from dropbox.com and stitch it in this fashion: 

stitch("http://dl.dropbox.com/s/ga0qbk1o17n17jj/Change_threatened_species.R")
..this is one line of code - can you dig it?..

BTW, for simplicity I use knitr::stitch with its default template...



You should get something like THIS PDF.



EDIT, MARCH 2013

OUTDATED! you can use this approach instead:



library(knitr); library(RCurl); library(WDI)

destfile = "script.txt"
x = getBinaryURL("https://dl.dropbox.com/s/ga0qbk1o17n17jj/Change_threatened_species.R", followlocation = TRUE, ssl.verifypeer = FALSE)
writeBin(x, destfile, useBytes = TRUE)
source(paste(tempdir(), "/script.txt", sep = ""))

stitch(paste(tempdir(), "/script.txt", sep = ""))






Labels:

knitr


                                              ,
                                            

R


                                              ,
                                            

Reproducible research


                                              ,
                                            

stitch()


                                              ,
                                            

WDI













4
Kommentare















Playing with knitr: Create Report with Dynamic List






Here is a little toy example using knitr, LaTeX/MiKTeX and Google Docs.

Say you had a list on Google Docs (say a list of attendants) and you want to print a report with it..

Then see this example using this Rnw-file and the output...



make the tex-file with:

library(knitr)
knit("knitr_list_of_attendants.Rnw")

..then compile the tex-file with MiKTeX.



or with this shortcut:

knit2pdf("knitr_list_of_attendants.Rnw")
browseURL("knitr_list_of_attendants.pdf") 










Labels:

Google Docs


                                              ,
                                            

knitr


                                              ,
                                            

R


                                              ,
                                            

Reproducible research














                                                  1 
                                                  Kommentare











                                          

                                        

                                        
                                      


1 May 2012



                                        
                                      





Quick Tip: Replace Values in Dataframe on Condition with Random Numbers






This one took me some time - though, in fact it is plain simple:

> options(scipen=999)
> (my_df <- data.frame(matrix(sample(c(0,1), 100, replace = T), 10, 10)))
   X1 X2 X3 X4 X5 X6 X7 X8 X9 X10
1   0  0  1  0  1  1  1  1  0   1
2   0  0  1  1  1  0  0  0  0   0
3   0  1  1  0  0  1  1  0  0   1
4   1  1  0  0  0  0  0  0  0   0
5   0  0  1  1  1  0  0  1  1   1
6   0  0  1  1  0  0  1  1  0   1
7   0  0  1  1  1  0  1  1  1   1
8   1  1  1  0  0  0  0  1  1   0
9   0  0  0  0  1  0  1  0  1   0
10  1  1  1  1  1  0  1  1  0   1
> my_df[my_df == 0] <- runif(sum(my_df==0), 0, 0.001)
> my_df
         X1       X2       X3       X4       X5       X6       X7       X8
1  0.000268 0.000926 1.000000 2.00e-05 1.000000 1.00e+00 1.00e+00 1.00e+00
2  0.000531 0.000882 1.000000 1.00e+00 1.000000 4.66e-04 3.96e-04 6.70e-04
3  0.000785 1.000000 1.000000 5.03e-04 0.000164 1.00e+00 1.00e+00 2.98e-04
4  1.000000 1.000000 0.000336 8.71e-04 0.000770 7.44e-05 6.49e-05 1.01e-04
5  0.000168 0.000674 1.000000 1.00e+00 1.000000 6.49e-04 2.26e-04 1.00e+00
6  0.000404 0.000950 1.000000 1.00e+00 0.000735 7.59e-04 1.00e+00 1.00e+00
7  0.000472 0.000516 1.000000 1.00e+00 1.000000 1.37e-04 1.00e+00 1.00e+00
8  1.000000 1.000000 1.000000 6.30e-06 0.000972 3.97e-04 5.46e-05 1.00e+00
9  0.000868 0.000577 0.000347 7.21e-05 1.000000 2.25e-04 1.00e+00 7.19e-05
10 1.000000 1.000000 1.000000 1.00e+00 1.000000 5.80e-05 1.00e+00 1.00e+00
         X9      X10
1  0.000880 1.00e+00
2  0.000754 7.99e-04
3  0.000817 1.00e+00
4  0.000982 7.85e-04
5  1.000000 1.00e+00
6  0.000104 1.00e+00
7  1.000000 1.00e+00
8  1.000000 9.43e-06
9  1.000000 7.79e-04
10 0.000099 1.00e+00







Labels:

R













0
Kommentare











                                          
                                        

                                        
                                      


25 Apr 2012



                                        
                                      





Reproducible Research: Running odfWeave with 7-zip






odfWeave is an R-package that is used for making dynamic reports by Sweave processing of Open Document Format (ODF) files. For anyone new to report generation and lacking knowledge of markup languages this might be a good starting point or even a true alternative to sweave / LATEX and others.



Now, anyone who recently tried to install the required zipping program for odfWeave might have noticed that there are currently no info-zip executables available (zip and unzip by info-zip software are suggested in the odfWeave manual). There are several other free zipping programs - but if you use these the default syntax for odfWeave changes. Looking into the internals it is revealed that the OS command specified for running the zipping program has to be adapted. There are some postings on the R-help mailing list concerning these topic, but none of them worked for me. After some trial and error I managed to get around this problem by using 7-zip with an adapted syntax and will share this here:




# write an in-file and save it to a folder:
dir()
[1] "Example_1_in.odt"

# testing the :
system("\"C:\\Program Files\\7-Zip\\7z.exe\" t -tzip Example_1_in.odt")

7-Zip 9.20  Copyright (c) 1999-2010 Igor Pavlov  2010-11-18

Processing archive: Example_1_in.odt

Testing     mimetype
Testing     Configurations2\statusbar
Testing     Configurations2\accelerator\current.xml
Testing     Configurations2\floater
Testing     Configurations2\popupmenu
Testing     Configurations2\progressbar
Testing     Configurations2\menubar
Testing     Configurations2\toolbar
Testing     Configurations2\images\Bitmaps
Testing     content.xml
Testing     manifest.rdf
Testing     styles.xml
Testing     meta.xml
Testing     Thumbnails\thumbnail.png
Testing     settings.xml
Testing     META-INF\manifest.xml

Everything is Ok

Folders: 7
Files: 9
Size:       30139
Compressed: 9572

# setting up cmd prompt:
library(odfWeave)
ctrl <- odfWeaveControl(zipCmd =
           c("\"C:\\Program Files\\7-Zip\\7z.exe\" a $$file$$",
             "\"C:\\Program Files\\7-Zip\\7z.exe\" x -tzip $$file$$"))
# running:
odfWeave("Example_1_in.odt", "Example_1_out.odt", control = ctrl)

  Copying  Example_1_in.odt
  Setting wd to  C:\Users\Kay\AppData\Local\Temp\RtmpghYef5/odfWeave2223195249
  Unzipping ODF file using "C:\Program Files\7-Zip\7z.exe" x -tzip "Example_1_in.odt"

7-Zip 9.20  Copyright (c) 1999-2010 Igor Pavlov  2010-11-18

Processing archive: Example_1_in.odt

Extracting  mimetype
Extracting  Configurations2\statusbar
Extracting  Configurations2\accelerator\current.xml
Extracting  Configurations2\floater
Extracting  Configurations2\popupmenu
Extracting  Configurations2\progressbar
Extracting  Configurations2\menubar
Extracting  Configurations2\toolbar
Extracting  Configurations2\images\Bitmaps
Extracting  content.xml
Extracting  manifest.rdf
Extracting  styles.xml
Extracting  meta.xml
Extracting  Thumbnails\thumbnail.png
Extracting  settings.xml
Extracting  META-INF\manifest.xml

Everything is Ok

Folders: 7
Files: 9
Size:       30139
Compressed: 9572

  Removing  Example_1_in.odt
  Creating a Pictures directory

  Pre-processing the contents
  Sweaving  content.Rnw

  Writing to file content_1.xml
  Processing code chunks ...

  'content_1.xml' has been Sweaved

  Removing content.xml

  Post-processing the contents
  Removing content.Rnw
  Removing styles.xml
  Renaming styles_2.xml to styles.xml
  Removing manifest.xml
  Renaming manifest_2.xml to manifest.xml
  Removing extra files

  Packaging file using "C:\Program Files\7-Zip\7z.exe" a "Example_1_in.odt"

7-Zip 9.20  Copyright (c) 1999-2010 Igor Pavlov  2010-11-18
Scanning

Creating archive Example_1_in.odt

Compressing  Configurations2\accelerator\current.xml
Compressing  content.xml
Compressing  manifest.rdf
Compressing  META-INF\manifest.xml
Compressing  meta.xml
Compressing  mimetype
Compressing  settings.xml
Compressing  styles.xml
Compressing  Thumbnails\thumbnail.png

Everything is Ok
  Copying  Example_1_in.odt
  Resetting wd
  Removing  C:\Users\Kay\AppData\Local\Temp\RtmpghYef5/odfWeave2223195249

  Done

# see the result:
dir()
[1] "Example_1_in.odt"  "Example_1_out.odt"






Labels:

ODF


                                              ,
                                            

odf Weave


                                              ,
                                            

R


                                              ,
                                            

Reproducible research













2
Kommentare











                                          
                                        

                                        
                                      


20 Apr 2012



                                        
                                      





Reproducible Research: Export Regression Table to MS Word






Here's a quick tip for anyone wishing to export results, say a regression table, from R to MS Word:





Read more »





Labels:

MS-Word


                                              ,
                                            

R


                                              ,
                                            

Regression


                                              ,
                                            

Reproducible research













11
Kommentare











                                          
                                        

                                        
                                      


6 Apr 2012



                                        
                                      





R-Bloggers' Web-Presence






We love them, we hate them: RANKINGS!



Rankings are an inevitable tool to keep the human rat race going. In this regard I'll pick up my last two posts (HERE & HERE) and have some fun with it by using it to analyse R-Bloggers' web presence. I will use number of hits in Google Search as an indicator.



I searched for URLs like this: https://www.google.com/search?q="http://www.twotorials.com" - meaning that only the exact blog-URL is searched.




 Blogs  NoHits   
 http://google-opensource.blogspot.com  82300  
 http://www.programmingr.com  73500  
 http://googleresearch.blogspot.com  58000  
 http://dirk.eddelbuettel.com  53000  
 http://borasky-research.net  33100  
 http://casoilresource.lawr.ucdavis.edu  32500  
 http://andrewgelman.com  30000  
 http://yihui.name  29600  
 http://xianblog.wordpress.com  27900  
 http://nsaunders.wordpress.com  27600  
 http://chem-bla-ics.blogspot.com  26600  
 http://plindenbaum.blogspot.com  24600  
 http://blog.ouseful.info  24300  
 http://www.vcasmo.com  24200  
 http://yz.mit.edu  23500  
 http://romainfrancois.blog.free.fr  22700  
 http://blog.revolutionanalytics.com  21000  
 http://robjhyndman.com  18400  
 http://freakonometrics.blog.free.fr  16100  
 http://perfdynamics.blogspot.com  15400  
 http://www.stubbornmule.net  14800  
 http://zoonek.free.fr  14800  
 http://jackman.stanford.edu  13900  
 http://www.bytemining.com  13700  
 http://learnr.wordpress.com  12600  
 http://tommy.chheng.com  12200  
 http://mazamascience.com  12000  
 http://www.investuotojas.eu  11500  
 http://www.r-statistics.com  11300  
 http://www.franklincenterhq.org  10800  
 http://gettinggeneticsdone.blogspot.com  10700  
 http://mpastell.com  9930  
 http://pineda-krch.com  9780  
 http://blog.saush.com  9220  
 http://www.premiersoccerstats.com  8950  
 http://developmentality.wordpress.com  7250  
 http://www.dataspora.com  7200  
 http://blog.hiremebecauseimsmart.com  7050  
 http://isomorphismes.tumblr.com  7040  
 http://www.mathfinance.cn  6930  
 http://blog.nguyenvq.com  6150  
 http://www.drewconway.com  5970  
 http://www.carlboettiger.info  5520  
 http://www.statisticsblog.com  5110  
 http://www.decisionsciencenews.com  4950  
 http://www.r-chart.com  4810  
 http://chartsgraphs.wordpress.com  4480  
 http://www.portfolioprobe.com  4410  
 http://procomun.wordpress.com  4330  
 http://jeromyanglim.blogspot.com  4080  
 http://spatialanalysis.co.uk  4080  
 http://www.theresearchkitchen.com  4080  
 http://www.forex-bloggers.com  4070  
 https://www.rmetrics.org  4050  
 http://princeofslides.blogspot.com  3900  
 http://www.cybaea.net  3740  
 http://www.cerebralmastication.com  3710  
 http://ygc.name  3670  
 http://ryouready.wordpress.com  3450  
 http://jeffreybreen.wordpress.com  3410  
 http://systematicinvestor.wordpress.com  3400  
 http://sgsong.blogspot.com  3310  
 http://industrialengineertools.blogspot.com  3290  
 http://www.r-tutor.com  3270  
 http://fishlab.ucdavis.edu  3270  
 http://ggorjan.blogspot.com  3250  
 http://blog.ynada.com  3220  
 http://farmacokratia.blogspot.com  3170  
 http://4dpiecharts.com  3130  
 http://heuristically.wordpress.com  3040  
 http://blog.rtwilson.com  2910  
 http://www.wekaleamstudios.co.uk  2890  
 http://www.dataists.com  2840  
 http://ikanb.wordpress.com  2750  
 http://shape-of-code.coding-guidelines.com  2730  
 http://onertipaday.blogspot.com  2710  
 http://blog.fosstrading.com  2700  
 http://blog.echen.me  2690  
 http://www.theusrus.de  2670  
 http://cloudnumbers.com  2630  
 http://paulbutler.org  2620  
 http://biostatmatt.com  2460  
 http://www.johnmyleswhite.com  2430  
 http://dataninja.wordpress.com  2360  
 http://realizationsinbiostatistics.blogspot.com  2340  
 http://statisfaction.wordpress.com  2300  
 http://uxblog.idvsolutions.com  2250  
 http://timelyportfolio.blogspot.com  2210  
 http://radfordneal.wordpress.com  2200  
 http://sas-and-r.blogspot.com  2200  
 http://pairach.com  2110  
 http://yusung.blogspot.com  2050  
 http://blog.flacso.edu.mx  2010  
 http://www.rensenieuwenhuis.nl  2000  
 http://michaeldhealy.com  1990  
 http://freigeist.devmag.net  1950  
 http://www.fernandohrosa.com.br  1920  
 http://statbandit.wordpress.com  1870  
 http://www.win-vector.com  1840  
 http://lukemiller.org  1830  
 http://ropensci.org  1720  
 http://www.eggwall.com  1650  
 http://benmazzotta.wordpress.com  1620  
 http://bms.zeugner.eu  1610  
 http://cartesianfaith.wordpress.com  1580  
 http://linkedscience.org  1570  
 http://stevemosher.wordpress.com  1550  
 http://intelligenttradingtech.blogspot.com  1520  
 http://www.imachordata.com  1480  
 http://blog.diegovalle.net  1470  
 http://jermdemo.blogspot.com  1430  
 http://nortalktoowise.com  1420  
 http://ekonometrics.blogspot.com  1340  
 http://digitheadslabnotebook.blogspot.com  1320  
 http://flyordie.sin.khk.be  1310  
 http://schamberlain.github.com  1230  
 http://gribblelab.org  1180  
 http://www.quantf.com  1130  
 http://offensivepolitics.net  1020  
 http://www.markmfredrickson.com  981  
 http://blog.mckuhn.de  948  
 http://erehweb.wordpress.com  889  
 http://confounding.net  886  
 http://simplystatistics.tumblr.com  875  
 http://www.babelgraph.org  859  
 http://csgillespie.wordpress.com  857  
 http://joewheatley.net  844  
 http://helmingstay.blogspot.com  843  
 http://theaverageinvestor.wordpress.com  825  
 http://quantitative-ecology.blogspot.com  785  
 http://zvfak.blogspot.com  776  
 http://ucfagls.wordpress.com  766  
 http://opendatagroup.com  760  
 http://cameron.bracken.bz  740  
 http://rtutorialseries.blogspot.com  738  
 http://opencpu.org  708  
 http://novicemetrics.blogspot.com  700  
 http://lamages.blogspot.com  680  
 http://nir-quimiometria.blogspot.com  679  
 http://tonybreyal.wordpress.com  677  
 http://brokeringclosure.wordpress.com  658  
 http://socialdatablog.com  643  
 http://dancingeconomist.blogspot.com  629  
 http://www.rtexttools.com  603  
 http://danganothererror.wordpress.com  589  
 http://thebiobucket.blogspot.com  567  
 http://holtmeier.de  531  
 http://val-systems.blogspot.com  519  
 http://thelogcabin.wordpress.com  489  
 http://dcemri.blogspot.com  484  
 http://rdatamining.wordpress.com  477  
 http://bridgewater.wordpress.com  460  
 http://www.rcasts.com  444  
 http://dsparks.wordpress.com  436  
 http://pr.cloudst.at  422  
 http://polstat.org  409  
 http://www.compmath.com  401  
 http://techno-realism.blogspot.com  399  
 http://www.backsidesmack.com  395  
 http://geotheory.org  393  
 http://miraisolutions.wordpress.com  367  
 http://econometricsense.blogspot.com  352  
 http://blog.binfalse.de  344  
 http://rforcancer.drupalgardens.com  317  
 http://blog.rstudio.org  316  
 http://mcfromnz.wordpress.com  309  
 http://www.quantumforest.com  309  
 http://blog.quanttrader.org  303  
 http://chrisladroue.com  293  
 http://www.michaelbommarito.com  289  
 http://procrun.com  280  
 http://mikeksmith.posterous.com  279  
 http://bio7.org  278  
 http://kbroman.wordpress.com  278  
 http://martynplummer.wordpress.com  272  
 http://bryer.org  268  
 http://www.funjackals.com  265  
 http://www.harlan.harris.name  252  
 http://www.milktrader.net  248  
 http://www.surefoss.org  241  
 http://rigorousanalytics.blogspot.com  231  
 http://www.jameskeirstead.ca  229  
 http://programming-r-pro-bro.blogspot.com  225  
 http://plausibel.blogspot.com  224  
 http://statistic-on-air.blogspot.com  217  
 http://mintgene.wordpress.com  212  
 http://moderntoolmaking.blogspot.com  205  
 http://quantitativeecology.blogspot.com  199  
 http://www.sigmafield.org  199  
 http://www.ancienteco.com  194  
 http://worldofrcraft.blogspot.com  191  
 http://rappster.wordpress.com  190  
 http://stotastic.com  189  
 http://evolvingspaces.blogspot.com  184  
 http://strugglingthroughproblems.blogspot.com  166  
 http://sharpstatistics.co.uk  161  
 http://leftcensored.skepsi.net  160  
 http://omegahat.wordpress.com  156  
 http://drunks-and-lampposts.com  155  
 http://amathew.com  152  
 http://onlinelabor.blogspot.com  147  
 http://johnramey.net  144  
 http://gossetsstudent.wordpress.com  138  
 http://tomhopper.wordpress.com  135  
 http://ggobi.blogspot.com  134  
 http://blog.fellstat.com  131  
 http://www.openanalytics.eu  130  
 http://www.numbertheory.nl  127  
 http://stats.blogoverflow.com  127  
 http://the-praise-of-insects.blogspot.com  122  
 http://lpenz.github.com  118  
 http://christophergandrud.blogspot.com  118  
 http://f.giorlando.org  112  
 http://bayesianbiologist.com  110  
 http://www.graphoftheweek.org  109  
 http://oneliner.soma20.com  109  
 http://inundata.org  107  
 http://geokook.wordpress.com  104  
 http://blog.datapunks.com  102  
 http://eranraviv.com  102  
 http://eranraviv.com  102  
 http://www.compbiome.com  101  
 http://www.techpolicy.ca  99  
 http://www.psychwire.co.uk  97  
 http://blog.carlislerainey.com  93  
 http://vasishth-statistics.blogspot.com  93  
 http://www.statsravingmad.com  93  
 http://using-r-project.blogspot.com  93  
 http://www.nikhilgopal.com  92  
 http://thedatamonkey.blogspot.com  92  
 http://jeffreyhorner.tumblr.com  90  
 http://menugget.blogspot.com  88  
 http://www.twotorials.com  88  
 http://dataexcursions.wordpress.com  84  
 http://viksalgorithms.blogspot.com  83  
 http://exploringdatablog.blogspot.com  81  
 http://sachaepskamp.com  81  
 http://aphysicistinwallstreet.blogspot.com  77  
 http://lastresortsoftware.blogspot.com  75  
 http://www.nomad.priv.at  72  
 http://applyr.blogspot.com  71  
 http://www.knowledgediscovery.jp  71  
 http://weitaiyun.blogspot.com  71  
 http://xmphforex.wordpress.com  71  
 http://statsadventure.blogspot.com  70  
 http://davenportspatialanalytics.squarespace.com  70  
 http://anandram.wordpress.com  69  
 http://rpint.wordpress.com  68  
 http://datadebrief.blogspot.com  66  
 http://blog.cloudstat.org  64  
 http://www.r-podcast.org  64  
 http://rmkrug.wordpress.com  62  
 http://denishaine.wordpress.com  61  
 http://expansed.com  58  
 http://r.andrewredd.us  57  
 http://isseing333.blogspot.com  57  
 http://solomonmessing.wordpress.com  57  
 http://rtricks.wordpress.com  57  
 http://anrprogrammer.wordpress.com  56  
 http://arungaikwad.wordpress.com  56  
 http://geolabs.wordpress.com  55  
 http://lookingatdata.blogspot.com  55  
 http://factbased.blogspot.com  54  
 http://severity.blogspot.com  54  
 http://swordofcrom.wordpress.com  53  
 http://librestats.wordpress.com  51  
 http://marcinkula.wordpress.com  51  
 http://gsoc2010r.wordpress.com  47  
 http://psyccomputing.blogspot.com  46  
 http://fabiomarroni.wordpress.com  45  
 http://jedifran.com  45  
 http://alstatr.blogspot.com  43  
 http://r-video-tutorial.blogspot.com  42  
 http://alexfarquhar.posterous.com  40  
 http://bmb-common.blogspot.com  40  
 http://rdataviz.wordpress.com  40  
 http://mypapertrades.blogspot.com  38  
 http://pitchrx.blogspot.com  38  
 http://simonmueller.net  38  
 http://statisfactions.wordpress.com  37  
 http://nzprimarysectortrade.wordpress.com  36  
 http://seanmulcahy.blogspot.com  36  
 http://www.speakingstatistically.com  35  
 http://joshpaulson.wordpress.com  34  
 http://learningrbasic.blogspot.com  34  
 http://mockquant.blogspot.com  33  
 http://costaleconomist.blogspot.com  32  
 http://rsnippets.blogspot.com  31  
 http://statmethods.wordpress.com  29  
 http://aviadklein.wordpress.com  28  
 http://obeautifulcode.com  28  
 http://blog.cloudst.at  24  
 http://rstats.posterous.com  23  
 http://notebookonthewebs.tumblr.com  22  
 http://0utlier.blogspot.com  21  
 http://gjkerns.github.com  21  
 http://eigensomething.blogspot.com  10  
 http://brocktibert.wordpress.com  9  
 http://toddjobe.blogspot.com  9  
 http://mickeymousemodels.blogspot.com  9  
 http://forgetfulfunctor.blogspot.com  9  
 http://rocknrblog.wordpress.com  9  
 http://dmbates.blogspot.com  8  
 http://blog.nextbiomotif.com  8  
 http://indiacrunchin.wordpress.com  8  
 http://blog.trenthauck.com  8  
 http://mikescnc.blogspot.com  8  
 http://jeroldhaas.blogspot.com  8  
 http://tlevine.tumblr.com  8  
 http://empty-moon-9726.heroku.com  8  
 http://www.proc-x.com  7  
 http://jointposterior.blogspot.com  7  
 http://gastonsanchez.wordpress.com  7  
 http://mlt-thinks.blogspot.com  7  
 http://rstats.wordpress.com  7  
 http://playingwithr.blogspot.com  7  
 http://scottmutchler.blogspot.com  6  
 http://iamdata.wordpress.com  6  
 http://sfchaos.blogspot.com  6  
 http://nightlordtw.wordpress.com  5  
 http://pleasepasstheroc.blogspot.com  5  
 http://wiekvoet.blogspot.com  5  
 http://d7.stattler.com  4  
 http://yetanotherrblog.blogspot.com  4  
 http://blog.iwanluijks.nl:80  3  
 https://rlearner.wordpress.com  3  
 http://margintale.blogspot.com  1  


When checking the results manually I discovered slight deviations in the numbers and admittedly have no clue why this is.. Sorry if any blog is under- overrepresented due to such an error - please report!



Here is the R-script:



require(XML)
library(stringr)
library(RCurl)
library(xtable)

GoogleHits.1 <- function(input)
   {
    url <- paste("https://www.google.com/search?q=\"",
                 input, "\"", sep = "")
 
    CAINFO = paste(system.file(package="RCurl"), "/CurlSSL/ca-bundle.crt", sep = "")
    script <- getURL(url, followlocation = TRUE, cainfo = CAINFO)
    doc <- htmlParse(script)
    res <- xpathSApply(doc, "//div[@id='subform_ctrl']/*", xmlValue)[[2]]
    return(as.integer(gsub("[^0-9]", "", res)))
   }

# Example:
GoogleHits.1("R%Statistical%Software")

###################### Begin get r-blogger's URLs: ###########################################
# get blogger urls with XML:
script <- getURL("www.r-bloggers.com")
doc <- htmlParse(script)
li <- getNodeSet(doc, "//ul[@class='xoxo blogroll']//a")
urls <- sapply(li, xmlGetAttr, "href")

# extract sensible blog urls:
# get ids for those with only 2 slashes (no 3rd in the end):
id <- which(nchar(gsub("[^/]", "", urls )) == 2)
slash_2 <- urls[id]

# find position of 3rd slash occurrence in strings:
slash_stop <- unlist(lapply(str_locate_all(urls, "/"),"[[", 3))
slash_3 <- substring(urls, first = 1, last = slash_stop - 1)

# replace the ones with 2 slashes:
blogs <- slash_3; blogs[id] <- slash_2

# dismiss:
blogs <- blogs[blogs != "http://domain"]
###################### End get r-blogger's URLs: #############################

###################### Begin Google Search: ##################################
# with lapply google mocks about roboting the site..
# I'm blocked on the 300th recursion..
# unlist(lapply(blogs, GoogleHits.1))

# try splitting, doesn't work (blocked the same as before)
res1 <- unlist(lapply(blogs[1:170], GoogleHits.1))
res2 <- unlist(lapply(blogs[171:334], GoogleHits.1))

# try to do it in 2 sessions (saving first result), or manually re-connnect host before second try:
df1 <- data.frame(Blogs = blogs[1:170], NoHits = res1, row.names = NULL)
save(df1, file = "df1.R")
load("df1.RData"); unlink("df1.RData")

# second run:
df2 <- data.frame(Blogs = blogs[171:334], NoHits = res2, row.names = NULL)

# bind dfs, sort by NoHits:
finres <- as.data.frame(rbind(df1, df2)); finres$Blogs <- as.character(finres$Blogs)
(finres <- finres[order(finres$NoHits, decreasing = T), ])

htmltab <- xtable(finres)
print(htmltab, type = "html", include.rownames=FALSE, file = "Bloggers.Google.Hits.htm")
###################### End Google Search #####################################

###################### Begin Plot: ###########################################
pdf("RBloggersWebPresence.pdf")
par(mar = c(4.5, 4.5, 3, 2), ylog = F)
plot(finres$NoHits, cex = 0.5, col = 3, 
     ylab = "No. of Hits in Google Search",
     xlab = "Blogs", log = "y")
set.seed(19)
rid <- sample(13:nrow(finres), 15)
text(x = rid, y = finres$NoHits[rid], 
     labels = finres$Blogs[rid],
     cex = 0.75, srt = 90, pos = 4, offset = -1) 
title(main = "R-Bloggers' Web Presence")
dev.off()
###################### End Plot ##############################################






Labels:

R


                                              ,
                                            

XML-Package


                                              ,
                                            

xtable()













5
Kommentare











                                          
                                        

                                        
                                      


5 Apr 2012



                                        
                                      





A Little Web Scraping Exercise with XML-Package






Some months ago I posted an example of how to get the links of the contributing blogs on the R-Blogger site. I used readLines() and did some string processing using regular expressions.



With package XML this can be drastically shortened - 

see this:

# get blogger urls with XML:
library(RCurl)
library(XML)
script <- getURL("www.r-bloggers.com")
doc <- htmlParse(script)
li <- getNodeSet(doc, "//ul[@class='xoxo blogroll']//a")
urls <- sapply(li, xmlGetAttr, "href")
With only a few lines of code this gives the same result as in the original post!  Here I will also process the urls for retrieving links to each blog's start page: 

# get ids for those with only 2 slashes (no 3rd in the end):
id <- which(nchar(gsub("[^/]", "", urls )) == 2)
slash_2 <- urls[id]

# find position of 3rd slash occurrence in strings:
slash_stop <- unlist(lapply(str_locate_all(urls, "/"),"[[", 3))
slash_3 <- substring(urls, first = 1, last = slash_stop - 1)

# final result, replace the ones with 2 slashes,
# which are lacking in slash_3:
blogs <- slash_3; blogs[id] <- slash_2
p.s.: Thanks to Vincent Zoonekynd for helping out with the XML syntax.





Labels:

R


                                              ,
                                            

Web Scraping


                                              ,
                                            

XML-Package













3
Kommentare











                                          
                                        

                                        
                                      


28 Mar 2012



                                        
                                      





Applying Same Changes  to Multiple Dataframes






How to apply the same changes to several dataframes and

save them to CSV: 



# a dataframe
a <- data.frame(x = 1:3, y = 4:6)

# make a list of several dataframes, then apply function (change column names, e.g.):
my.list <- list(a, a)
my.list <- lapply(my.list, function(x) {names(x) <- c("a", "b") ; return(x)})

# save dfs to csv with similar lapply-call:
n <- 1:length(my.list)
lapply(n, function(ni) {
               write.table(file = paste(ni, ".csv", sep = ""), 
               my.list[ni], sep = ";", row.names = F)
               }
       )


Edit:



I'll extend this to a script that reads several files from a directory, applies changes to the files in the same fashion and finally saves files back to the directory (as HERE)



# clean up
rm(list = ls())
setwd(tempdir())
unlink(dir(tempdir()))

# create some files in tempdir:
a <- data.frame(x = 1:3, y = 4:6)
b <- data.frame(x = 10:13, y = 14:15)
write.csv(a, "file1.csv", row.names = F)
write.csv(b, "file2.csv", row.names = F)

# now read all files to list:
mycsv = dir(pattern=".csv")

n <- length(mycsv)
mylist <- vector("list", n)

for(i in 1:n) mylist[[i]] <- read.csv(mycsv[i])

# now change something in all dfs in list:
mylist <- lapply(mylist, function(x) {names(x) <- c("a", "b") ; return(x)})

# then save back dfs:# then save back dfs:
for(i in 1:n) {
   write.csv(file = sub(".csv", "_new.csv", mycsv[i]),
             mylist[i], row.names = F)
}






Labels:

R













5
Kommentare











                                          
                                        

                                        
                                      


26 Mar 2012



                                        
                                      





How to Extract Citation from a Body of Text






Say, you have a text and you want to retrieve the cited names and years of publication. You wouldn't want to this by hand, wouldn't you?



Try the following approach:

(the text sample comes from THIS freely available publication)



library(stringr)

(txt <- readLines("http://dl.dropbox.com/u/68286640/Test_Doc.txt"))
[1] "1  Introduction"                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                             
[2] ""                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            
[3] "Climate projections of the Intergovernmental Panel on Climate Change (IPCC) forecast a general increase of seasonal temperatures in the present century across the temperate zone, aggravated by decreasing amounts of summer rainfall in certain regions at lower latitudes (Christensen et al. 2007). These changes imply serious ecological consequences, especially in biome transition zones (Fischlin et al. 2007). Due to their economic importance, as well as their major contribution to supporting, regulating and cultural ecosystem services, predicted changes and shifts in temperate forest ecosystems receive wide public attention. It’s no surprise that dominant forest tree species are frequently modelled in bioclimatic impact studies (e.g., Sykes et al. 1996; Iverson, Prasad 2001; Rehfeldt et al. 2003; Ohlemüller et al. 2006). However, most studies focus on continental-scale effects of climate change, using low resolution climatic and species distribution data. More detailed regional studies focussing on specific endangered regions are also needed (Benito Garzón et al. 2008). Such regional studies have already been prepared for several European regions, including the Swiss Alps (Bolliger et al. 2000), the British Isles (Berry et al. 2002) and the Iberian Peninsula (Benito Garzón et al. 2008)."                                                                                                                    
[4] ""                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            
[5] "In this study, we aim to (1) identify the limiting macroclimatic factors and to (2) predict the future boundaries of beech (Fagus sylvatica L.) and sessile oak (Quercus petraea (Mattuschka) Liebl.) forests in a region highly vulnerable to climatic extremes. Both tree species form extensive zonal forests throughout Central Europe and reach their low altitude/low latitude, xeric (Mátyás et al. 2009) distributional limits within the forest-steppe biome transition zone of Hungary. The rise of temperature, and especially summer rainfall deficits expected for the twenty-first century, may strongly affect both species. Nevertheless, regarding the potential future distribution of these important forest tree species along their xeric boundaries in Central Europe, there has been no detailed regional analysis before. Experimental studies and field survey data suggest a strong decline in beech regeneration (Czajkowski et al. 2005; Penuelas et al. 2007; Lenoir et al. 2009) and increased mortality rates following prolonged droughts (Berki et al. 2009). Mass mortality and range retraction are potential consequences, which have been already sporadically observed in field survey studies (Jump et al. 2009; Allen et al. 2010; Mátyás et al. 2009). With the study, we intend to assist in assessing overall risks, locating potentially affected regions and supporting the formulation of appropriate measures and strategies."
[6] ""                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                            
[7] "Beech and sessile oak forests of Hungary are to a large extent “trailing edge” populations (Hampe and Petit 2005), which should be preferably modelled using specific modelling strategies (Thuiller et al. 2008). Most modelling studies do not differentiate between leading and trailing edges and rely on assumptions and techniques which are intrinsically more appropriate for “leading edge” situations. Being aware of these challenges, we compiled a statistical methodology customized to yield inference on influential variables and providing robust and reliable predictions for climate-dependent populations near their xeric limits. We laid special emphasis on three features in the course of the modelling process: (1) screening of the occurrence data in order to limit modelling to plausible zonal (i.e. macroclimatically determined) occurrences, (2) avoiding pitfalls of statistical pseudoreplication caused by spatial autocorrelation (a problem to which regional distribution modelling studies are particularly prone; Dormann 2007) and (3) simultaneous use of several initial and boundary conditions in an ensemble modelling framework (Araújo et al. 2005; Araújo and New 2007; Beaumont et al. 2007). "                                                                                                                                                                                                                         

# retrieve text inbetween parantheses:
extr1 <- unlist(str_extract_all(txt, pattern = "\\(.*?\\)"))

# keep only those elements which have four digit strings (years):
extr2 <- extr1[grep("[0-9]{4}", extr1)]

# extract partial strings starting with uppercase letter (name)
# and end in a four digit string (year):
(str_extract(extr2, "[A-Z].*[0-9]"))
 [1] "Christensen et al. 2007"                                                              
 [2] "Fischlin et al. 2007"                                                                 
 [3] "Sykes et al. 1996; Iverson, Prasad 2001; Rehfeldt et al. 2003; Ohlemüller et al. 2006"
 [4] "Benito Garzón et al. 2008"                                                            
 [5] "Bolliger et al. 2000"                                                                 
 [6] "Berry et al. 2002"                                                                    
 [7] "Benito Garzón et al. 2008"                                                            
 [8] "Mátyás et al. 2009"                                                                   
 [9] "Czajkowski et al. 2005; Penuelas et al. 2007; Lenoir et al. 2009"                     
[10] "Berki et al. 2009"                                                                    
[11] "Jump et al. 2009; Allen et al. 2010; Mátyás et al. 2009"                              
[12] "Hampe and Petit 2005"                                                                 
[13] "Thuiller et al. 2008"                                                                 
[14] "Dormann 2007"                                                                         
[15] "Araújo et al. 2005; Araújo and New 2007; Beaumont et al. 2007"                        

# as proposed by a commentator -
# do this if you want each citation seperately:
(unlist(str_extract_all(extr2, "[A-Z].*?[0-9]{4}")))
 [1] "Christensen et al. 2007"   "Fischlin et al. 2007"     
 [3] "Sykes et al. 1996"         "Iverson, Prasad 2001"     
 [5] "Rehfeldt et al. 2003"      "Ohlemüller et al. 2006"   
 [7] "Benito Garzón et al. 2008" "Bolliger et al. 2000"     
 [9] "Berry et al. 2002"         "Benito Garzón et al. 2008"
[11] "Mátyás et al. 2009"        "Czajkowski et al. 2005"   
[13] "Penuelas et al. 2007"      "Lenoir et al. 2009"       
[15] "Berki et al. 2009"         "Jump et al. 2009"         
[17] "Allen et al. 2010"         "Mátyás et al. 2009"       
[19] "Hampe and Petit 2005"      "Thuiller et al. 2008"     
[21] "Dormann 2007"              "Araújo et al. 2005"       
[23] "Araújo and New 2007"       "Beaumont et al. 2007"







Labels:

Citation


                                              ,
                                            

R


                                              ,
                                            

Regex


                                              ,
                                            

stringr













2
Kommentare











                                          
                                        

                                        
                                      


25 Mar 2012



                                        
                                      





Classification Trees and Spatial Autocorrelation






I'm currently trying to model species presence / absence data (N = 523) that were collected over a geographic area and are possibly spatially autocorrelated. Samples come from preferential sites (sea level > 1200 m, obligatory presence of permanent waterbodies, etc). My main goal is to infere on environmental factors determining the occurrence rate of several (amphibian) species and to rule out spatial autocorrelation.











Read more »





Labels:

Classification Trees


                                              ,
                                            

ctree()


                                              ,
                                            

R


                                              ,
                                            

Spatial Autocorrelation













0
Kommentare











                                          
                                        

                                        
                                      


24 Mar 2012



                                        
                                      





Custom Summary Stats as Dataframe or List






On Stackoverflow I found this useful example on how to apply custom statistics on a dataframe and return the results as list or dataframe:







Read more »





Labels:

by()


                                              ,
                                            

ddply()


                                              ,
                                            

R













0
Kommentare











                                          
                                        

                                        
                                      


14 Mar 2012



                                        
                                      





Creating a Stratified Random Sample of a Dataframe






Expanding on a question on Stack Overflow I'll show how to make a stratified random sample of a certain size: 

d <- expand.grid(id = 1:35000, stratum = letters[1:10])

p = 0.1

dsample <- data.frame()

system.time(
for(i in levels(d$stratum)) {
  dsub <- subset(d, d$stratum == i)
  B = ceiling(nrow(dsub) * p)
  dsub <- dsub[sample(1:nrow(dsub), B), ]
  dsample <- rbind(dsample, dsub) 
  }
)

# size per stratum in resulting df is 10 % of original size:
table(dsample$stratum)






Labels:

R


                                              ,
                                            

Stratified Random Sampling













2
Kommentare











                                          
                                        

                                        
                                      


13 Mar 2012



                                        
                                      





R-Function to Read Data from Google Docs Spreadsheets










I used this idea posted on Stack Overflow to plug together a function for reading data from Google Docs spreadsheets into R. 









Read more »





Labels:

Google Docs


                                              ,
                                            

Google Spreadsheets


                                              ,
                                            

R













6
Kommentare











                                          
                                        

                                        
                                      


28 Feb 2012



                                        
                                      





Apprentice Piece with Lattice Graphs






Lattice graphs can be quite tedious to learn. I don't use them too often and  when I need them I usually have to dig deep into the archives for details on the parameter details.

The here presented example may serve as a welcome template for the usage of panel functions, panel ordering, for drawing of lattice keys, etc. 

You can download the example data HERE.



(Also, check this resource with examples by the lattice-author).  







Read more »





Labels:

Lattice graphs


                                              ,
                                            

panel function


                                              ,
                                            

R


                                              ,
                                            

R Graphs













0
Kommentare











                                          
                                        

                                        
                                      


27 Feb 2012



                                        
                                      





Testing the Effect of a Factor within each Level of another Factor with R-Package {contrast}








This is a small example of how custom contrasts can easily be applied with the contrast-package. The package-manual has several useful explanations and the below example was actually grabbed from there.

This example can also be applied to a GLM but I choose to use a LM because the coefficients are more easily interpreted.











Read more »





Labels:

Contrasts


                                              ,
                                            

R













0
Kommentare











                                          
                                        

                                        
                                      


1 Feb 2012



                                        
                                      





Transformation of Several Variables in a Dataframe






This is how I transform several columns of a dataframe, i.e., with count-data into binary coded data (this would apply also for any other conversion..).



count1 <- count2 <- count3 <- count4 <- sample(c(rep(0, 10), 1:10))
some <- LETTERS[1:20]  
thing <- letters[1:20]  
mydf <- data.frame(count1, count2, count3, count4, some, thing)

ids <- grep("count", names(mydf))
myfun <- function(x) {ifelse(x > 0, 1, 0)}
mydf[, ids] <- lapply(mydf[, ids], myfun)



p.s.: Let me know if you know of a slicker way.






Labels:

R













4
Kommentare











                                          
                                        

                                        
                                      


26 Jan 2012



                                        
                                      





A Short Example with R-Package osmar..












Following up my last post in which I praised the capabilities of the osmar-package I give a short example...



ps: You can also find this example at GitHub HERE.



















Read more »





Labels:

Maps


                                              ,
                                            

OpenStreetMaps


                                              ,
                                            

osmar


                                              ,
                                            

R













3
Kommentare















osmar - Don't Miss this New R-Geo-Package!








The osmar-package enables you to retrieve all geographic elements of OpenStreetMap via its API.

I.e., you can retrieve a street, river, state-boundary or whatever and use this as a spatial object in R.



It's overwhelming thinking of the endless playground that is opened for R-users by this package!



And, owing to altruistic R-package authors (like the ones of osmar, Thomas Schlesinger and Manuel J. A. Eugster) the oligarch's (ESRI) power evermore crumbles away..





Labels:

OpenStreetMap


                                              ,
                                            

osmar


                                              ,
                                            

R














                                                  1 
                                                  Kommentare











                                          
                                        

                                        
                                      


1 Jan 2012



                                        
                                      





R-Function to Source all Functions from a GitHub Repository








Here's a function that sources all scripts from an arbitrary github-repository. At the moment the function downloads the whole repo and sources functions kept in a folder named "Functions" - this may be adapted for everyones own purpose.









Read more »





Labels:

GitHub


                                              ,
                                            

R













0
Kommentare

Blogs	NoHits
http://google-opensource.blogspot.com	82300
http://www.programmingr.com	73500
http://googleresearch.blogspot.com	58000
http://dirk.eddelbuettel.com	53000
http://borasky-research.net	33100
http://casoilresource.lawr.ucdavis.edu	32500
http://andrewgelman.com	30000
http://yihui.name	29600
http://xianblog.wordpress.com	27900
http://nsaunders.wordpress.com	27600
http://chem-bla-ics.blogspot.com	26600
http://plindenbaum.blogspot.com	24600
http://blog.ouseful.info	24300
http://www.vcasmo.com	24200
http://yz.mit.edu	23500
http://romainfrancois.blog.free.fr	22700
http://blog.revolutionanalytics.com	21000
http://robjhyndman.com	18400
http://freakonometrics.blog.free.fr	16100
http://perfdynamics.blogspot.com	15400
http://www.stubbornmule.net	14800
http://zoonek.free.fr	14800
http://jackman.stanford.edu	13900
http://www.bytemining.com	13700
http://learnr.wordpress.com	12600
http://tommy.chheng.com	12200
http://mazamascience.com	12000
http://www.investuotojas.eu	11500
http://www.r-statistics.com	11300
http://www.franklincenterhq.org	10800
http://gettinggeneticsdone.blogspot.com	10700
http://mpastell.com	9930
http://pineda-krch.com	9780
http://blog.saush.com	9220
http://www.premiersoccerstats.com	8950
http://developmentality.wordpress.com	7250
http://www.dataspora.com	7200
http://blog.hiremebecauseimsmart.com	7050
http://isomorphismes.tumblr.com	7040
http://www.mathfinance.cn	6930
http://blog.nguyenvq.com	6150
http://www.drewconway.com	5970
http://www.carlboettiger.info	5520
http://www.statisticsblog.com	5110
http://www.decisionsciencenews.com	4950
http://www.r-chart.com	4810
http://chartsgraphs.wordpress.com	4480
http://www.portfolioprobe.com	4410
http://procomun.wordpress.com	4330
http://jeromyanglim.blogspot.com	4080
http://spatialanalysis.co.uk	4080
http://www.theresearchkitchen.com	4080
http://www.forex-bloggers.com	4070
https://www.rmetrics.org	4050
http://princeofslides.blogspot.com	3900
http://www.cybaea.net	3740
http://www.cerebralmastication.com	3710
http://ygc.name	3670
http://ryouready.wordpress.com	3450
http://jeffreybreen.wordpress.com	3410
http://systematicinvestor.wordpress.com	3400
http://sgsong.blogspot.com	3310
http://industrialengineertools.blogspot.com	3290
http://www.r-tutor.com	3270
http://fishlab.ucdavis.edu	3270
http://ggorjan.blogspot.com	3250
http://blog.ynada.com	3220
http://farmacokratia.blogspot.com	3170
http://4dpiecharts.com	3130
http://heuristically.wordpress.com	3040
http://blog.rtwilson.com	2910
http://www.wekaleamstudios.co.uk	2890
http://www.dataists.com	2840
http://ikanb.wordpress.com	2750
http://shape-of-code.coding-guidelines.com	2730
http://onertipaday.blogspot.com	2710
http://blog.fosstrading.com	2700
http://blog.echen.me	2690
http://www.theusrus.de	2670
http://cloudnumbers.com	2630
http://paulbutler.org	2620
http://biostatmatt.com	2460
http://www.johnmyleswhite.com	2430
http://dataninja.wordpress.com	2360
http://realizationsinbiostatistics.blogspot.com	2340
http://statisfaction.wordpress.com	2300
http://uxblog.idvsolutions.com	2250
http://timelyportfolio.blogspot.com	2210
http://radfordneal.wordpress.com	2200
http://sas-and-r.blogspot.com	2200
http://pairach.com	2110
http://yusung.blogspot.com	2050
http://blog.flacso.edu.mx	2010
http://www.rensenieuwenhuis.nl	2000
http://michaeldhealy.com	1990
http://freigeist.devmag.net	1950
http://www.fernandohrosa.com.br	1920
http://statbandit.wordpress.com	1870
http://www.win-vector.com	1840
http://lukemiller.org	1830
http://ropensci.org	1720
http://www.eggwall.com	1650
http://benmazzotta.wordpress.com	1620
http://bms.zeugner.eu	1610
http://cartesianfaith.wordpress.com	1580
http://linkedscience.org	1570
http://stevemosher.wordpress.com	1550
http://intelligenttradingtech.blogspot.com	1520
http://www.imachordata.com	1480
http://blog.diegovalle.net	1470
http://jermdemo.blogspot.com	1430
http://nortalktoowise.com	1420
http://ekonometrics.blogspot.com	1340
http://digitheadslabnotebook.blogspot.com	1320
http://flyordie.sin.khk.be	1310
http://schamberlain.github.com	1230
http://gribblelab.org	1180
http://www.quantf.com	1130
http://offensivepolitics.net	1020
http://www.markmfredrickson.com	981
http://blog.mckuhn.de	948
http://erehweb.wordpress.com	889
http://confounding.net	886
http://simplystatistics.tumblr.com	875
http://www.babelgraph.org	859
http://csgillespie.wordpress.com	857
http://joewheatley.net	844
http://helmingstay.blogspot.com	843
http://theaverageinvestor.wordpress.com	825
http://quantitative-ecology.blogspot.com	785
http://zvfak.blogspot.com	776
http://ucfagls.wordpress.com	766
http://opendatagroup.com	760
http://cameron.bracken.bz	740
http://rtutorialseries.blogspot.com	738
http://opencpu.org	708
http://novicemetrics.blogspot.com	700
http://lamages.blogspot.com	680
http://nir-quimiometria.blogspot.com	679
http://tonybreyal.wordpress.com	677
http://brokeringclosure.wordpress.com	658
http://socialdatablog.com	643
http://dancingeconomist.blogspot.com	629
http://www.rtexttools.com	603
http://danganothererror.wordpress.com	589
http://thebiobucket.blogspot.com	567
http://holtmeier.de	531
http://val-systems.blogspot.com	519
http://thelogcabin.wordpress.com	489
http://dcemri.blogspot.com	484
http://rdatamining.wordpress.com	477
http://bridgewater.wordpress.com	460
http://www.rcasts.com	444
http://dsparks.wordpress.com	436
http://pr.cloudst.at	422
http://polstat.org	409
http://www.compmath.com	401
http://techno-realism.blogspot.com	399
http://www.backsidesmack.com	395
http://geotheory.org	393
http://miraisolutions.wordpress.com	367
http://econometricsense.blogspot.com	352
http://blog.binfalse.de	344
http://rforcancer.drupalgardens.com	317
http://blog.rstudio.org	316
http://mcfromnz.wordpress.com	309
http://www.quantumforest.com	309
http://blog.quanttrader.org	303
http://chrisladroue.com	293
http://www.michaelbommarito.com	289
http://procrun.com	280
http://mikeksmith.posterous.com	279
http://bio7.org	278
http://kbroman.wordpress.com	278
http://martynplummer.wordpress.com	272
http://bryer.org	268
http://www.funjackals.com	265
http://www.harlan.harris.name	252
http://www.milktrader.net	248
http://www.surefoss.org	241
http://rigorousanalytics.blogspot.com	231
http://www.jameskeirstead.ca	229
http://programming-r-pro-bro.blogspot.com	225
http://plausibel.blogspot.com	224
http://statistic-on-air.blogspot.com	217
http://mintgene.wordpress.com	212
http://moderntoolmaking.blogspot.com	205
http://quantitativeecology.blogspot.com	199
http://www.sigmafield.org	199
http://www.ancienteco.com	194
http://worldofrcraft.blogspot.com	191
http://rappster.wordpress.com	190
http://stotastic.com	189
http://evolvingspaces.blogspot.com	184
http://strugglingthroughproblems.blogspot.com	166
http://sharpstatistics.co.uk	161
http://leftcensored.skepsi.net	160
http://omegahat.wordpress.com	156
http://drunks-and-lampposts.com	155
http://amathew.com	152
http://onlinelabor.blogspot.com	147
http://johnramey.net	144
http://gossetsstudent.wordpress.com	138
http://tomhopper.wordpress.com	135
http://ggobi.blogspot.com	134
http://blog.fellstat.com	131
http://www.openanalytics.eu	130
http://www.numbertheory.nl	127
http://stats.blogoverflow.com	127
http://the-praise-of-insects.blogspot.com	122
http://lpenz.github.com	118
http://christophergandrud.blogspot.com	118
http://f.giorlando.org	112
http://bayesianbiologist.com	110
http://www.graphoftheweek.org	109
http://oneliner.soma20.com	109
http://inundata.org	107
http://geokook.wordpress.com	104
http://blog.datapunks.com	102
http://eranraviv.com	102
http://eranraviv.com	102
http://www.compbiome.com	101
http://www.techpolicy.ca	99
http://www.psychwire.co.uk	97
http://blog.carlislerainey.com	93
http://vasishth-statistics.blogspot.com	93
http://www.statsravingmad.com	93
http://using-r-project.blogspot.com	93
http://www.nikhilgopal.com	92
http://thedatamonkey.blogspot.com	92
http://jeffreyhorner.tumblr.com	90
http://menugget.blogspot.com	88
http://www.twotorials.com	88
http://dataexcursions.wordpress.com	84
http://viksalgorithms.blogspot.com	83
http://exploringdatablog.blogspot.com	81
http://sachaepskamp.com	81
http://aphysicistinwallstreet.blogspot.com	77
http://lastresortsoftware.blogspot.com	75
http://www.nomad.priv.at	72
http://applyr.blogspot.com	71
http://www.knowledgediscovery.jp	71
http://weitaiyun.blogspot.com	71
http://xmphforex.wordpress.com	71
http://statsadventure.blogspot.com	70
http://davenportspatialanalytics.squarespace.com	70
http://anandram.wordpress.com	69
http://rpint.wordpress.com	68
http://datadebrief.blogspot.com	66
http://blog.cloudstat.org	64
http://www.r-podcast.org	64
http://rmkrug.wordpress.com	62
http://denishaine.wordpress.com	61
http://expansed.com	58
http://r.andrewredd.us	57
http://isseing333.blogspot.com	57
http://solomonmessing.wordpress.com	57
http://rtricks.wordpress.com	57
http://anrprogrammer.wordpress.com	56
http://arungaikwad.wordpress.com	56
http://geolabs.wordpress.com	55
http://lookingatdata.blogspot.com	55
http://factbased.blogspot.com	54
http://severity.blogspot.com	54
http://swordofcrom.wordpress.com	53
http://librestats.wordpress.com	51
http://marcinkula.wordpress.com	51
http://gsoc2010r.wordpress.com	47
http://psyccomputing.blogspot.com	46
http://fabiomarroni.wordpress.com	45
http://jedifran.com	45
http://alstatr.blogspot.com	43
http://r-video-tutorial.blogspot.com	42
http://alexfarquhar.posterous.com	40
http://bmb-common.blogspot.com	40
http://rdataviz.wordpress.com	40
http://mypapertrades.blogspot.com	38
http://pitchrx.blogspot.com	38
http://simonmueller.net	38
http://statisfactions.wordpress.com	37
http://nzprimarysectortrade.wordpress.com	36
http://seanmulcahy.blogspot.com	36
http://www.speakingstatistically.com	35
http://joshpaulson.wordpress.com	34
http://learningrbasic.blogspot.com	34
http://mockquant.blogspot.com	33
http://costaleconomist.blogspot.com	32
http://rsnippets.blogspot.com	31
http://statmethods.wordpress.com	29
http://aviadklein.wordpress.com	28
http://obeautifulcode.com	28
http://blog.cloudst.at	24
http://rstats.posterous.com	23
http://notebookonthewebs.tumblr.com	22
http://0utlier.blogspot.com	21
http://gjkerns.github.com	21
http://eigensomething.blogspot.com	10
http://brocktibert.wordpress.com	9
http://toddjobe.blogspot.com	9
http://mickeymousemodels.blogspot.com	9
http://forgetfulfunctor.blogspot.com	9
http://rocknrblog.wordpress.com	9
http://dmbates.blogspot.com	8
http://blog.nextbiomotif.com	8
http://indiacrunchin.wordpress.com	8
http://blog.trenthauck.com	8
http://mikescnc.blogspot.com	8
http://jeroldhaas.blogspot.com	8
http://tlevine.tumblr.com	8
http://empty-moon-9726.heroku.com	8
http://www.proc-x.com	7
http://jointposterior.blogspot.com	7
http://gastonsanchez.wordpress.com	7
http://mlt-thinks.blogspot.com	7
http://rstats.wordpress.com	7
http://playingwithr.blogspot.com	7
http://scottmutchler.blogspot.com	6
http://iamdata.wordpress.com	6
http://sfchaos.blogspot.com	6
http://nightlordtw.wordpress.com	5
http://pleasepasstheroc.blogspot.com	5
http://wiekvoet.blogspot.com	5
http://d7.stattler.com	4
http://yetanotherrblog.blogspot.com	4
http://blog.iwanluijks.nl:80	3
https://rlearner.wordpress.com	3
http://margintale.blogspot.com	1





Newer Posts




Older Posts



Home





Subscribe to:

Posts
                                      (
                                      Atom
                                      )













Search Blog









Selected Labels




Animation


                    (
                    3
                    )
                  



GBIF


                    (
                    2
                    )
                  



GScholarScraper()


                    (
                    4
                    )
                  



Google Docs


                    (
                    7
                    )
                  



Google Maps


                    (
                    4
                    )
                  



Google Scholar


                    (
                    6
                    )
                  



HTML


                    (
                    4
                    )
                  



Invasive Species


                    (
                    3
                    )
                  



Lattice graphs


                    (
                    5
                    )
                  



MS-Word


                    (
                    5
                    )
                  



Maps


                    (
                    9
                    )
                  



Myricaria germanica


                    (
                    3
                    )
                  



Ordination diagram


                    (
                    3
                    )
                  



PERMANOVA


                    (
                    3
                    )
                  



R


                    (
                    99
                    )
                  



R Graphs


                    (
                    16
                    )
                  



Regex


                    (
                    4
                    )
                  



Species Distribution


                    (
                    3
                    )
                  



String-Manipulation


                    (
                    4
                    )
                  



VBA


                    (
                    10
                    )
                  



Vegan Package


                    (
                    5
                    )
                  



Web Scraping


                    (
                    5
                    )
                  



Word-Cloud


                    (
                    5
                    )
                  



adonis()


                    (
                    3
                    )
                  



as.raster()


                    (
                    2
                    )
                  



jpeg


                    (
                    4
                    )
                  



paired samples


                    (
                    2
                    )
                  



panel function


                    (
                    2
                    )
                  



permutation test


                    (
                    2
                    )
                  



rasterImage()


                    (
                    2
                    )
                  



readLines()


                    (
                    4
                    )
                  



saveHTML


                    (
                    2
                    )
                  



xyplot()


                    (
                    1
                    )
                  





Blog Archive









                  ► 
                



2025


                (
                1
                )
              





                  ► 
                



March


                (
                1
                )
              









                  ► 
                



2024


                (
                2
                )
              





                  ► 
                



September


                (
                1
                )
              







                  ► 
                



June


                (
                1
                )
              









                  ► 
                



2023


                (
                6
                )
              





                  ► 
                



November


                (
                1
                )
              







                  ► 
                



October


                (
                1
                )
              







                  ► 
                



August


                (
                1
                )
              







                  ► 
                



April


                (
                1
                )
              







                  ► 
                



March


                (
                1
                )
              







                  ► 
                



February


                (
                1
                )
              









                  ► 
                



2022


                (
                5
                )
              





                  ► 
                



December


                (
                1
                )
              







                  ► 
                



September


                (
                1
                )
              







                  ► 
                



July


                (
                1
                )
              







                  ► 
                



June


                (
                1
                )
              







                  ► 
                



February


                (
                1
                )
              









                  ► 
                



2020


                (
                1
                )
              





                  ► 
                



November


                (
                1
                )
              









                  ► 
                



2019


                (
                3
                )
              





                  ► 
                



August


                (
                2
                )
              







                  ► 
                



February


                (
                1
                )
              









                  ► 
                



2017


                (
                1
                )
              





                  ► 
                



July


                (
                1
                )
              









                  ► 
                



2016


                (
                5
                )
              





                  ► 
                



December


                (
                1
                )
              







                  ► 
                



October


                (
                1
                )
              







                  ► 
                



July


                (
                1
                )
              







                  ► 
                



February


                (
                1
                )
              







                  ► 
                



January


                (
                1
                )
              









                  ► 
                



2015


                (
                7
                )
              





                  ► 
                



October


                (
                1
                )
              







                  ► 
                



April


                (
                2
                )
              







                  ► 
                



February


                (
                3
                )
              







                  ► 
                



January


                (
                1
                )
              









                  ► 
                



2014


                (
                18
                )
              





                  ► 
                



December


                (
                6
                )
              







                  ► 
                



November


                (
                2
                )
              







                  ► 
                



September


                (
                1
                )
              







                  ► 
                



July


                (
                2
                )
              







                  ► 
                



May


                (
                2
                )
              







                  ► 
                



March


                (
                3
                )
              







                  ► 
                



February


                (
                1
                )
              







                  ► 
                



January


                (
                1
                )
              









                  ► 
                



2013


                (
                26
                )
              





                  ► 
                



September


                (
                5
                )
              







                  ► 
                



August


                (
                2
                )
              







                  ► 
                



June


                (
                4
                )
              







                  ► 
                



May


                (
                4
                )
              







                  ► 
                



April


                (
                7
                )
              







                  ► 
                



March


                (
                2
                )
              







                  ► 
                



February


                (
                2
                )
              








                ▼ 
              


2012


                (
                37
                )
              




                ▼ 
              


December


                (
                2
                )
              



Convert OpenStreetMap Objects to KML with R




Taxonomy with R: Exploring the Taxize-Package










                  ► 
                



November


                (
                4
                )
              



So, What Are You? ..A Plant? ..An Animal? -- Nope,...




Add Comments in MS-Word using VBA




WIKI Search in Excel with VBA




Calculate Single Contour-Line from DEM with QGIS /...










                  ► 
                



September


                (
                3
                )
              



Merging Dataframes by Partly Matching String




Reading and Text Mining a PDF-File in R




Get Long-Term Climate Data from KNMI Climate Explorer










                  ► 
                



August


                (
                4
                )
              



Follow-Up: Making a Word Cloud for a Search Result...




Toy Example with GScholarScraper_3.1




Web-Scraper for Google Scholar Updated!




Tcl/Tk GUI Example with Variable Input by User










                  ► 
                



June


                (
                4
                )
              



Use IUCN API with R & XPath




Avoid Overplotting of Text in Ordination Diagram




A Wrapper Function for Instant Package Installatio...




FloraWeb Plant Species Report via R










                  ► 
                



May


                (
                4
                )
              



Source R-Script from Dropbox




knitr-Example: Use World Bank Data to Generate Rep...




Playing with knitr: Create Report with Dynamic List




Quick Tip: Replace Values in Dataframe on Conditio...










                  ► 
                



April


                (
                4
                )
              



Reproducible Research: Running odfWeave with 7-zip




Reproducible Research: Export Regression Table to ...




R-Bloggers' Web-Presence




A Little Web Scraping Exercise with XML-Package










                  ► 
                



March


                (
                6
                )
              



Applying Same Changes  to Multiple Dataframes




How to Extract Citation from a Body of Text




Classification Trees and Spatial Autocorrelation




Custom Summary Stats as Dataframe or List




Creating a Stratified Random Sample of a Dataframe




R-Function to Read Data from Google Docs Spreadsheets










                  ► 
                



February


                (
                3
                )
              



Apprentice Piece with Lattice Graphs




Testing the Effect of a Factor within each Level o...




Transformation of Several Variables in a Dataframe










                  ► 
                



January


                (
                3
                )
              



A Short Example with R-Package osmar..




osmar - Don't Miss this New R-Geo-Package!




R-Function to Source all Functions from a GitHub R...












                  ► 
                



2011


                (
                57
                )
              





                  ► 
                



December


                (
                10
                )
              







                  ► 
                



November


                (
                15
                )
              







                  ► 
                



October


                (
                5
                )
              







                  ► 
                



September


                (
                6
                )
              







                  ► 
                



August


                (
                3
                )
              







                  ► 
                



June


                (
                4
                )
              







                  ► 
                



May


                (
                5
                )
              







                  ► 
                



April


                (
                9
                )
              










Science Daily Feed




Loading...






GIS Stackexchange Feed




Loading...






Stackoverflow Feed




Loading...






RBloggers Feed




Loading...













theBioBucket@github






















I do not guarantee for the correctness of any content on this website. Please concern copyright terms of second-party data sources and cite sources properly.


This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License.




Powered by Blogger.

TABS

22 Dec 2012

16 Dec 2012

28 Nov 2012

20 Nov 2012

12 Nov 2012

6 Nov 2012

29 Sept 2012

28 Sept 2012

6 Sept 2012

31 Aug 2012

25 Aug 2012

22 Aug 2012

11 Aug 2012

29 Jun 2012

25 Jun 2012

19 Jun 2012

11 Jun 2012

14 May 2012

2 May 2012

1 May 2012

25 Apr 2012

20 Apr 2012

6 Apr 2012

5 Apr 2012

28 Mar 2012

26 Mar 2012

25 Mar 2012

24 Mar 2012

14 Mar 2012

13 Mar 2012

28 Feb 2012

27 Feb 2012

1 Feb 2012

26 Jan 2012

1 Jan 2012