When can you encode email addresses using RFC 2047?

Summary

RFC 2047 encoding is a technique for representing non-ASCII characters within specific sections of email headers. The consensus across experts, marketers, and documentation is that RFC 2047 encoding should primarily be used for the display name (also known as the friendly name or phrase) associated with an email address in the From, To, and Cc headers, as well as in the Subject and Comments headers. The actual email address (the 'addr-spec') must remain in standard US-ASCII format. Encoding the email address itself is discouraged and can be seen as a security risk. Furthermore, it is invalid to use RFC 2047 encoding in List-Unsubscribe headers.

Key findings

  • Display Name Focus: RFC 2047 is primarily intended for encoding the display name (or friendly name) in email headers.
  • Address Restriction: The actual email address (addr-spec) must remain in US-ASCII format.
  • Security Concerns: Encoding the entire email address poses a security risk and may allow malicious addresses to be hidden.
  • Valid Header Sections: RFC 2047 encoding is valid in the Subject, Comments, and the display name portions of From, To, and Cc headers.
  • Invalid Header Sections: RFC 2047 encoding is invalid in List-Unsubscribe headers.

Key considerations

  • Character Encoding: When using special or international characters, ensure only the display name is encoded, not the email address itself.
  • Header Compliance: Follow RFC 2047 specifications when encoding header fields to ensure proper rendering across various email clients.
  • Security Awareness: Be mindful of the security implications of encoding and avoid practices that could mask malicious addresses.
  • List-Unsubscribe Avoidance: Do not use RFC 2047 encoding in List-Unsubscribe headers as it is not valid and may cause issues.
  • Documentation Reference: Refer to RFC 2047 Section 5 for details on what can be replaced by an encoded word in email headers.

What email marketers say
10Marketer opinions

RFC 2047 encoding is used to represent non-ASCII characters in email headers, specifically within the display name (or friendly name) portion of email addresses (e.g., in the From, To, and Cc fields). The actual email address itself must remain in standard US-ASCII format. Encoding the email address is often seen as a security risk because it can be used to hide malicious addresses.

Key opinions

  • Display Name Encoding: RFC 2047 is primarily for encoding the display name or friendly name associated with an email address.
  • Address Restriction: The actual email address (the part before and after the @ symbol) must remain in US-ASCII.
  • Security Risk: Encoding the entire email address can create a security vulnerability by allowing malicious actors to disguise harmful addresses.

Key considerations

  • Special Characters: When using special or international characters, ensure you are only encoding the display name and not the email address itself.
  • Header Encoding: Use RFC 2047 methods to properly encode the header fields, especially the display name, to ensure they are displayed correctly across different email clients.
  • Security Implications: Be aware of the security implications of encoding email addresses and avoid any practice that could be used to mask malicious addresses.
Marketer view

Email marketer from Email Marketing Forum responds that if you want to encode a name with special characters in the from field, use RFC 2047 encoding for the name portion, leaving the email address itself in ASCII.

December 2021 - Email Marketing Forum
Marketer view

Email marketer from Campaign Monitor shares that encode the header fields, particularly the display name, according to RFC 2047 to ensure international characters display correctly in different email clients. However, the actual email address component should remain in standard ASCII.

March 2024 - Campaign Monitor
Marketer view

Email marketer from Reddit shares that RFC 2047 encoding is used for the display name/friendly name portion of the from and to headers, allowing for special characters. The actual email address cannot be encoded this way.

March 2025 - Reddit
Marketer view

Email marketer from SuperOffice shares that if you need to send an email with international characters, focus on encoding the display name correctly using RFC 2047. The actual email address part must remain as standard ASCII.

February 2025 - SuperOffice
Marketer view

Email marketer from GMass supports that you can use special characters in the Display From Name, but the email address itself should be a valid ASCII email address.

January 2022 - GMass
Marketer view

Email marketer from Email Marketing Tips Blog states that RFC 2047 encoding is primarily for encoding the 'display name' part of an email address, not the actual email address itself. The address part must remain in US-ASCII.

April 2024 - Email Marketing Tips Blog
Marketer view

Email marketer from Litmus responds that it's crucial to encode the appropriate parts of the email header when dealing with special characters. Specifically, the display name in the From and To fields, should be encoded using RFC 2047 methods to ensure proper rendering across email clients; the actual email address must remain in ASCII.

January 2025 - Litmus
Marketer view

Email marketer from Email Geeks shares that encoded email addresses are pretty widely seen as a security risk because it lets you hide malicious addresses that look like legitimate ones.

April 2023 - Email Geeks
Marketer view

Email marketer from Mailjet responds that when using special characters, encode the header (especially the display name), using RFC 2047 methods for handling non-ASCII characters. The email address itself must remain in ASCII format.

October 2021 - Mailjet
Marketer view

Email marketer from Stack Overflow responds that RFC 2047 encoding should only be applied to the display name (the text part) of the email address, not the actual address itself. They provide an example of how the encoded display name should look.

May 2024 - Stack Overflow

What the experts say
3Expert opinions

RFC 2047 encoding is restricted to specific parts of email headers. It is valid for human-readable text, such as the Subject line, comments, and the display name portion of From, To, and Cc headers. Encoding the actual email address itself is invalid. Additionally, RFC 2047 encoding is not allowed in List-Unsubscribe headers.

Key opinions

  • Limited Encoding Scope: RFC 2047 encoding is limited to specific areas within email headers.
  • Valid Uses: Valid uses include the Subject line, comments, and display names in From, To, and Cc headers.
  • Invalid Uses: Encoding the email address itself and using RFC 2047 in List-Unsubscribe headers are not permitted.

Key considerations

  • Header Sections: Pay close attention to which sections of the email header you're encoding to ensure compliance with RFC 2047.
  • Human Readability: Only encode human-readable parts; machine-readable parts like the email address should remain in their original format.
  • List-Unsubscribe Restrictions: Avoid using RFC 2047 encoding within List-Unsubscribe headers, as this is considered invalid.
Expert view

Expert from Word to the Wise mentions that RFC 2047 encoding in List-Unsubscribe headers is invalid. RFC 2047 encoding is allowed in the display name part of the From, To, and CC headers, and in the Subject and Comments headers.

September 2023 - Word to the Wise
Expert view

Expert from Email Geeks shares RFC 2047 #5, specifying what may be replaced by an encoded word, which is a short list of places you’re allowed to do that - Subject, Comments headers, any parenthesised comment or the phrase preceding an address in From, To, Cc.

July 2024 - Email Geeks
Expert view

Expert from Email Geeks explains that you can only encode the human readable text with RFC 2047, such as the Subject line, and friendly comments in From: and To: headers, but you can’t encode the actual email address. Although it might sometimes work, it’s never valid.

December 2024 - Email Geeks

What the documentation says
4Technical articles

RFC 2047 defines how to represent non-US-ASCII characters in email header fields. This encoding is applied to specific parts of the message header, including 'Subject', 'Comments', and the display name (or phrase portion) within address fields like 'From', 'To', and 'Cc'. The actual email address ('addr-spec') itself must remain in US-ASCII characters. RFC 5322 reinforces that the email address must conform to ASCII syntax, while allowing the display name to be encoded.

Key findings

  • Encoding Purpose: RFC 2047 is used to represent non-ASCII characters in email headers.
  • Header Field Scope: It applies to specific header fields such as 'Subject', 'Comments', and the display name within address fields.
  • Address Limitation: The actual email address ('addr-spec') must remain in US-ASCII.

Key considerations

  • Character Set Support: Use RFC 2047 when needing to display international characters in email headers.
  • Field Selection: Ensure you are only applying the encoding to the appropriate header fields and portions within those fields (e.g., the display name).
  • Address Compliance: Always keep the actual email address in standard ASCII format.
Technical article

Documentation from Microsoft explains that to display international characters in email headers, you must use MIME encoding (RFC 2047). This encoding is applied to header fields like Subject, From, To, etc. but is primarily intended for the descriptive text, not the email address itself.

October 2022 - Microsoft Documentation
Technical article

Documentation from Word to the Wise explains that RFC 2047 allows encoding of the text part of headers such as Subject, Comments, and certain parts of address fields (From, To, CC), such as the display name, but not the email address itself. It refers to Section 5 of RFC 2047.

September 2024 - Word to the Wise
Technical article

Documentation from RFC 5322 (which obsoletes RFC 2822) specifies the Internet Message Format. Within address fields (like From, To), the 'display-name' can be encoded using RFC 2047 to support non-ASCII characters, but the 'addr-spec' (the actual email address) must conform to the ABNF syntax which primarily uses ASCII characters.

December 2023 - RFC 5322
Technical article

Documentation from RFC 2047 specifies that an 'encoded-word' is a technique to represent character data in non-US-ASCII character sets within message header fields. It can be used in specific parts of the message header, such as 'Subject', 'Comments', or in structured header fields like 'From', 'To', and 'Cc', within the comment and phrase portions.

September 2022 - RFC 2047