Skip to main content

Mastering String Manipulation in Python: An Introduction to Strings and String Slicing

Strings


Strings in python are defined by enclosing with either single quotation marks (") or double quotation marks (")

1

2

str_hello = "Hello..."

str_hello = 'Hello...'

It does not make a difference in the way string is stored and/or used in the program based on the way it is declared (i.e., by using single quotation marks or double quotation marks). 

Multi-line strings

Above declaration is applicable when the string is of a single line. If we have a string that has multiple lines, multi-line string can be defined by enclosing with either three single quotation marks (''') and double quotation marks (""").

1

2

3

4

5

6

7

multi_string = """Hello World,

This is a multi-line string.

enclosed in double quotation marks"""

 

multi_string = '''Hello World,

This is a multi-line string.

enclosed in single quotation marks'''

Accessing part of the string by using index

String is similar to an array of characters (there is no character data type in Python) and Index starts from 0.

1

2

3

str_hello = "Hello..."

print(str_hello[0]) # H

print(str_hello[1]) # e


Similarly data can be accessed using negative index (i.e., -1 for the last position) 

1

2

3

str_hello = "Hello..."

print(str_hello[-1]) # .

print(str_hello[-4]) # o


Below are the index positions of a string from above example. 

H e l l o . . .

0 1 2 3 4 5 6 7

-8-7-6-5-4-3-2-1

Looping through the string

We mentioned that the string is like an array of characters and we can access the specific character using it's index. 

So, how to loop through all the characters of a string? There are two ways to achieve this.
  • Retrieve string length and loop through the range and access character by using index.
  • Loop through the string directly. 
So, How to calculate the length of the string? 'len()' function returns length of a string. 

1

2

3

string = "Hello World"

for i in range(len(string)):

   print(string[i])


In this example, for loop repeats for every character in the string and character can be accessed by the variable specified (char in this example) through out the loop.

1

2

3

string = "Hello World"

for char in string:

   print(char)


What is string slicing? 

By using the index we can only access one character at a time. We can use slicing to retrieve portion of the string. 

For slicing, we need to provide the starting index (included) and ending index (not included in result). 

1

2

3

4

5

6

7

8

'''

Hello World

012345678910

'''

 

string = "Hello World"

print(string[0:5]) # Hello

print(string[2:5]) # llo


In the above example (Line - 7), we are passing index '0' (as the starting position) and index '5' (as the ending position, not included) which returns the part of the string 'Hello'. Value in the index 5 (' ') is not included in the result. 

In Line - 8, we are passing index '2' and '5', which would return the portion of the string from 2 to 4 (both included).

In the cases like Line - 7 where we are retrieving the porting of the string from beginning till the specified index, we won't really need to include index '0'. 

1

2

string = "Hello World"

print(string[:5]) # Hello


Leaving the start index would consider the string from the beginning. 

Similarly if we leave off end index, it would return the string from start index till end. 

1

2

string = "Hello World"

print(string[6:]) # World


So, what happens if we don't mention both start and end index? It returns the full string. 

1

2

string = "Hello World"

print(string[:]) # Hello World


Check if a character/string present in the string

We have seen how to access a character or a portion of string from a string. Now, Can we check if a character or a string is part of the string or not? Yes, we can do this by using 'in' and 'not in'.

1

2

3

string = "Hello World"

print('H' in string) # True

print('H' not in string) # False


In the above example,
  • Line - 2: We are checking if 'H' is in the string, which returns True if present and False if not present. 
  • Line - 3: We are checking if 'H' is not in the string, which returns False if present and True if not present. 
Similarly, we can use a string to check if it is part of another string. 

1

2

3

string = "Hello World"

print('Hello' in string) # True

print('Hello' not in string) # False


It is highly unlikely we use 'in' and 'not in' operator the way we have described (this is only for better understanding). 

Most likely we would be using this in a condition. 

1

2

3

4

5

string = "Hello World"

if 'Hello' in string:

   print('Hello is present in the string')

else:

   print('Hello is not present in the string')


1

2

3

4

5

string = "Hello World"

if 'Hello' not in string:

   print('Hello is not present in the string')

else:

   print('Hello is present in the string')



We have seen how to define a string & multi-line string, accessing a portion of the string (string slicing) and checking if a character or string is part of the string or not. Hope this has been a bit of help in understanding the use of strings in Python.


If you have any Suggestions or Feedback, Please leave a comment below or use Contact Form.

Comments

Popular posts from this blog

All about READ in RPGLE & Why we use it with SETLL/SETGT?

READ READ is one of the most used Opcodes in RPGLE. As the name suggests main purpose of this Opcode is to read a record from Database file. What are the different READ Opcodes? To list, Below are the five Opcodes.  READ - Read a Record READC - Read Next Changed Record READE - Read Equal Key Record READP - Read Prior Record READPE - Read Prior Equal Record We will see more about each of these later in this article. Before that, We will see a bit about SETLL/SETGT .  SETLL (Set Lower Limit) SETLL accepts Key Fields or Relative Record Number (RRN) as Search Arguments and positions the file at the Corresponding Record (or Next Record if exact match isn't found).  SETGT (Set Greater Than) SETGT accepts Key Fields or Relative Record Number (RRN) as Search Arguments and positions the file at the Next Record (Greater Than the Key value). Syntax: SETLL SEARCH-ARGUMENTS/KEYFIELDS FILENAME SETGT  SEARCH-ARGUMENTS/KEYFIELDS FILENAME One of the below can be passed as Search Arguments. Key Fiel

What we need to know about CHAIN (RPGLE) & How is it different from READ?

CHAIN READ & CHAIN, These are one of the most used (& useful) Opcodes by any RPG developer. These Opcodes are used to read a record from file. So, What's the difference between CHAIN & READ?   CHAIN operation retrieves a record based on the Key specified. It's more like Retrieving Random record from a Database file based on the Key fields.  READ operation reads the record currently pointed to from a Database file. There are multiple Opcodes that start with READ and all are used to read a record but with slight difference. We will see more about different Opcodes and How they are different from each other (and CHAIN) in another article. Few differences to note.  CHAIN requires Key fields to read a record where as READ would read the record currently pointed to (SETLL or SETGT are used to point a Record).  If there are multiple records with the same Key data, CHAIN would return the same record every time. READE can be used to read all the records with the specified Ke

Extract a portion of a Date/Time/Timestamp in RPGLE - IBM i

%SUBDT Extracting Year, Month, Day, Hour, Minutes, Seconds or Milli seconds of a given Date/Time/Timestamp is required most of the times.  This can be extracted easily by using %SUBDT. BIF name looks more similar to %SUBST which is used to extract a portion of string by passing from and two positions of the original string. Instead, We would need to pass a value (i.e., Date, Time or Timestamp ) and Unit (i.e., *YEARS, *MONTHS, *DAYS, *HOURS, *MINUTES, *SECONDS or *MSECONDS) to %SUBDT.  Valid unit should be passed for the type of the value passed. Below are the valid values for each type. Date - *DAYS, *MONTHS, *YEARS Time - *HOURS, *MINUTES, *SECONDS Timestamp - *DAYS, *MONTHS, *YEARS, *HOURS, *MINUTES, *SECONDS, *MSECONDS Syntax: %SUBDT(value : unit { : digits { : decpos} }) Value and Unit are the mandatory arguments.  Digits and Decimal positions are optional and can only be used with *SECONDS for Timestamp. We can either pass the full form for the unit or use the short form. Below i