Skip to main content

Split & Join Strings in Python

Split & Join Strings in Python

In Python, strings are sequences of characters. Various built-in functions are available to work with strings. 

We often come across the need to split the string into a list of substrings (with use of a delimiter character to split the string) and to join a list of strings into a single string.

Splitting a String

We can use the string method 'split()' to split a string into a list of substrings. This method by default split the string into a list of substrings which are separated by blank space. 

In the below example, we are calling split() method without passing any argument.

1

2

3

4

sample_string = "This is a  string"

sub_strings = sample_string.split()

print(sub_strings)

 

Below is the result. 

['This', 'is', 'a', 'string']

If we notice the string, there are two blank spaces between "a" and "string" and same is not present in the substring list.

We can also pass the specific delimiter while calling the "split()" method. 

In the below example, we are calling split() method with one blank space " " as an argument and see how the result is different compared to the earlier example. 

1

2

3

4

sample_string = "This is a  string"

sub_strings = sample_string.split(" ")

print(sub_strings)

 

Below is the result. 

['This', 'is', 'a', '', 'string']

We can see an additional entry with no data (this is because of having two blank spaces between "a" and "string". 

split() method considers one blank space as delimiter (as passed) and added an additional entry with no value. 

Let's have a look at another example by passing comma (,) as delimiter. 

1

2

3

4

sample_string = "This,is,a,string"

sub_strings = sample_string.split(",")

print(sub_strings)

 

Below is the result. 

['This', 'is', 'a', 'string']

There is one other useful argument we could pass to split() method i.e., maxsplit (maximum number of splits). This argument specifies how many splits are to be done on the string. If there are more number of delimiters than the max split passed, it would only split the maximum number of times specified.  

1

2

3

4

sample_string = "This,is,a,string"

sub_strings = sample_string.split(",", 2)

print(sub_strings)

 

Below is the result. 

['This', 'is', 'a,string']

Delimiter is present three times and if we don't use the maximum split, string is split three times (i.e., four elements are returned in a substring. 

In this example, we are passing '2' as max split, so string is split maximum of two times i.e., would return 3 elements with out splitting the string for the 3rd time. 

Joining a List of Strings

Similarly, we often need to join the list of strings. To join a list of strings into a single string, we can use the join() method. 

This method takes a list of strings as an argument and returns a string that is the concatenation of the list elements, with a delimiter character between strings from the list passed. 

Let's have a look at an example. 

1

2

3

4

substrings = ['This', 'is', 'a', 'string']

string = " ".join(substrings)

print(string)

 

Below is the result. 

This is a string

Unlike split() where the method is called from a string and delimiter is passed as an argument, join() method is called from a delimiter and list of strings is passed as an argument. 

And, Removing the blank space as a delimiter would not add any default space or any other character to separate the strings passed. 

1

2

3

4

substrings = ['This', 'is', 'a', 'string']

string = "".join(substrings)

print(string)

 

Below is the result. 

Thisisastring

We can use any other delimiter. Let's have a look at another example by using "-" as a delimiter. 

1

2

3

4

substrings = ['This', 'is', 'a', 'string']

string = "-".join(substrings)

print(string)

 

Below is the result.

This-is-a-string

As we have seen, the split() and join() methods are useful for manipulating strings in Python. They allow you to easily split a string into a list of substrings or join a list of strings into a single string, with the required delimiter. Hope this has been a bit of help in understanding the use of strings in Python.


If you have any Suggestions or Feedback, Please leave a comment below or use Contact Form.

Comments

Popular posts from this blog

All about READ in RPGLE & Why we use it with SETLL/SETGT?

READ READ is one of the most used Opcodes in RPGLE. As the name suggests main purpose of this Opcode is to read a record from Database file. What are the different READ Opcodes? To list, Below are the five Opcodes.  READ - Read a Record READC - Read Next Changed Record READE - Read Equal Key Record READP - Read Prior Record READPE - Read Prior Equal Record We will see more about each of these later in this article. Before that, We will see a bit about SETLL/SETGT .  SETLL (Set Lower Limit) SETLL accepts Key Fields or Relative Record Number (RRN) as Search Arguments and positions the file at the Corresponding Record (or Next Record if exact match isn't found).  SETGT (Set Greater Than) SETGT accepts Key Fields or Relative Record Number (RRN) as Search Arguments and positions the file at the Next Record (Greater Than the Key value). Syntax: SETLL SEARCH-ARGUMENTS/KEYFIELDS FILENAME SETGT  SEARCH-ARGUMENTS/KEYFIELDS FILENAME One of the below can be passed as Search Arguments. Key Fiel

What we need to know about CHAIN (RPGLE) & How is it different from READ?

CHAIN READ & CHAIN, These are one of the most used (& useful) Opcodes by any RPG developer. These Opcodes are used to read a record from file. So, What's the difference between CHAIN & READ?   CHAIN operation retrieves a record based on the Key specified. It's more like Retrieving Random record from a Database file based on the Key fields.  READ operation reads the record currently pointed to from a Database file. There are multiple Opcodes that start with READ and all are used to read a record but with slight difference. We will see more about different Opcodes and How they are different from each other (and CHAIN) in another article. Few differences to note.  CHAIN requires Key fields to read a record where as READ would read the record currently pointed to (SETLL or SETGT are used to point a Record).  If there are multiple records with the same Key data, CHAIN would return the same record every time. READE can be used to read all the records with the specified Ke

Extract a portion of a Date/Time/Timestamp in RPGLE - IBM i

%SUBDT Extracting Year, Month, Day, Hour, Minutes, Seconds or Milli seconds of a given Date/Time/Timestamp is required most of the times.  This can be extracted easily by using %SUBDT. BIF name looks more similar to %SUBST which is used to extract a portion of string by passing from and two positions of the original string. Instead, We would need to pass a value (i.e., Date, Time or Timestamp ) and Unit (i.e., *YEARS, *MONTHS, *DAYS, *HOURS, *MINUTES, *SECONDS or *MSECONDS) to %SUBDT.  Valid unit should be passed for the type of the value passed. Below are the valid values for each type. Date - *DAYS, *MONTHS, *YEARS Time - *HOURS, *MINUTES, *SECONDS Timestamp - *DAYS, *MONTHS, *YEARS, *HOURS, *MINUTES, *SECONDS, *MSECONDS Syntax: %SUBDT(value : unit { : digits { : decpos} }) Value and Unit are the mandatory arguments.  Digits and Decimal positions are optional and can only be used with *SECONDS for Timestamp. We can either pass the full form for the unit or use the short form. Below i