pyspark.sql.functions.md5#

pyspark.sql.functions.md5(col)[source]#

Calculates the MD5 digest and returns the value as a 32 character hex string.

New in version 1.5.0.

Changed in version 3.4.0: Supports Spark Connect.

Parameters
colColumn or column name

target column to compute on.

Returns
Column

the column for computed results.

Examples

>>> import pyspark.sql.functions as sf
>>> df = spark.createDataFrame([('ABC',)], ['a'])
>>> df.select('*', sf.md5('a')).show(truncate=False)
+---+--------------------------------+
|a  |md5(a)                          |
+---+--------------------------------+
|ABC|902fbdd2b1df0c4f70b4a5d23525e932|
+---+--------------------------------+