PYTHON
Extract Hashtags from Text in Python
A Python regex pattern to efficiently find and extract all hashtags (words prefixed with '#') from a given string, useful for social media content analysis.
import re
def extract_hashtags(text):
# Regex to find words starting with #
# It captures the word after #, excluding the # itself
hashtags = re.findall(r'#(\w+)', text)
return hashtags
# Examples:
# print(extract_hashtags("This is a #test with #multiple #hashtags and no #symbol.")) # ['test', 'multiple', 'hashtags', 'symbol']
# print(extract_hashtags("No hashtags here.")) # []
How it works: This Python snippet defines a function `extract_hashtags` that uses a regular expression (`r'#(\w+)'`) to find and extract all hashtags from a given text. The `\w+` matches one or more word characters (alphanumeric + underscore), and the parentheses create a capturing group, allowing `re.findall` to return only the word part of the hashtag, without the '#' symbol.