Table of Contents

Reading specific rows from a CSV file into R can be achieved by using the read.csv() function in R and setting the row.names parameter to the row numbers of the desired rows. This will return a data frame containing only the rows with the given row.names. Additionally, the skip argument can be used to skip a certain number of rows before starting to read the CSV file. Finally, the nrows argument can be used to limit the number of rows that are read from the file.

You can use the following methods to read specific rows from a CSV file into R:

Method 1: Import CSV File Starting from Specific Row

df <- read.csv("my_data.csv", skip=2)

This particular example will skip the first two rows in the CSV file and import all other rows in the file starting at the third row.

Method 2: Import CSV File where Rows Meet Condition

library(sqldf)

df <- read.csv.sql("my_data.csv",
                    sql = "select * from file where `points` > 90", eol = "n")

This particular example will only import the rows in the CSV file where the value in the ‘points’ column is greater than 90.

The following examples show how to use each of these methods in practice with the following CSV file called my_data.csv:

Example 1: Import CSV File Starting from Specific Row

The following code shows how to import the CSV file and skip the first two rows in the file:

#import data frame and skip first two rows
df <- read.csv('my_data.csv', skip=2)

#view data frame
df

  B X90 X28 X28.1
1 C  86  31    24
2 D  88  39    24
3 E  95  34    28

Notice that the first two rows (with teams A and B) have been skipped when importing the CSV file.

By default, R attempts to use the values in the next available row as the column names.

To rename the columns, you can use the names() function as follows:

#rename columns
names(df) <- c('team', 'points', 'assists', 'rebounds')

#view updated data frame
df

  team points assists rebounds
1    C     86      31       24
2    D     88      39       24
3    E     95      34       28

Example 2: Import CSV File where Rows Meet Condition

We can use the read.csv.sql function from the sqldf package to do so:

library(sqldf)

#only import rows where points > 90
df <- read.csv.sql("my_data.csv",
                    sql = "select * from file where `points` > 90", eol = "n")

#view data frame
df

  team points assists rebounds
1  "A"     99      33       30
2  "E"     95      34       28

Notice that only the two rows in the CSV file where the value in the ‘points’ column is greater than 90 have been imported.

Note #1: In this example, we used the eol argument to specify that the “end of line” in the file is indicated by n, which represents a line break.

Note #2: In this example, we used a simple SQL query but you can write more complex queries to filter rows by even more conditions.

The following tutorials explain how to perform other common tasks in R:

How to Read a CSV from a URL in R

How to read specific rows from a CSV file into R?

Example 1: Import CSV File Starting from Specific Row

Example 2: Import CSV File where Rows Meet Condition

Requst a

Scale

Example 1: Import CSV File Starting from Specific Row

Example 2: Import CSV File where Rows Meet Condition

Related terms:

Requst a

Scale