PostgreSQL Tutorial: REGEXP_REPLACE Function

September 17, 2023

Summary: In this tutorial, you will learn how to use the PostgreSQL REGEXP_REPLACE() function to replace strings that match a regular expression.

Table of Contents

The PostgreSQL REGEXP_REPLACE() function replaces substrings that match a POSIX regular expression by a new substring.

Note that if you want to perform simple string replacement, you can use the REPLACE() function.

Syntax

The syntax of the PostgreSQL REGEXP_REPLACE() function is as follows:

REGEXP_REPLACE(source, pattern, replacement_string,[, flags])

Arguments

The REGEXP_REPLACE() function accepts four arguments:

1) source

The source is a string that replacement should be taken place.

2) pattern

The pattern is a POSIX regular expression for matching substrings that should be replaced.

3) replacement_string

The replacement_string is a string that to replace the substrings which match the regular expression pattern.

4) flags

The flags argument is one or more character that controls the matching behavior of the function e.g., i allows case-insensitive matching, n enables matching any character and also the newline character.

Return value

The PostgreSQL REGEXP_REPLACE() function returns a new string with the substrings, which match a regular expression pattern, replaced by a new substring.

Examples

Let’s see some examples to understand how the REGEXP_REPLACE() function works.

A) Name rearrangement

Suppose, you have a name of a person in the following format:

first_name last_name

For example, John Doe

And you want to rearrange the this name as follows for the reporting purpose.

last_name, first_name

To do this , you can use the REGEXP_REPLACE() function as shown below:

SELECT REGEXP_REPLACE('John Doe','(.*) (.*)','\2, \1');

The output of the statement is:

'Doe, John'

B) String removal

Imagine you have string data with mixed alphabets and digits as follows:

ABC12345xyz

The following statement removes all alphabets e.g., A, B, C, etc from the source string:

SELECT REGEXP_REPLACE('ABC12345xyz','[[:alpha:]]','','g');

The output is:

'12345'

In this example,

  • [[:alpha:]] matches any alphabets
  • '' is the replacement string
  • 'g' instructs the function to remove all alphabets, not just the first one.

Similarly, you can remove all digits in the source string by using the following statement:

SELECT REGEXP_REPLACE('ABC12345xyz','[[:digit:]]','','g');

And the output is:

'ABCxyz'

C) Redundant spaces removal

The following example removes unwanted spaces that appear more than once in a string.

SELECT REGEXP_REPLACE('This  is    a   test   string','( ){2,}',' ');

The following picture illustrates the output:

PostgreSQL REGEXP_REPLACE Example

In this tutorial, you have learned how to use the PostgreSQL REGEXP_REPLACE() function to replace substrings that match a regular expression with a new substring.

See more

PostgreSQL Tutorial: String Functions

PostgreSQL Documentation: String Functions and Operators