To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 è¾°a}vè¾°a}vB 11101000101111101011000001100001011111010111011011101000101111101011000001100001011111010111011001000010 e8beb0617d76e8beb0617d7642
SJIS-WIN ??°a}v??°a}vB 001111110011111110000001100010110110000101111101011101100011111100111111100000011000101101100001011111010111011001000010 3f3f818b617d763f3f818b617d7642
EUC-JP è?°a}vè?°a}vB 10001111101010111011001000111111101000011110101101100001011111010111011010001111101010111011001000111111101000011110101101100001011111010111011001000010 8fabb23fa1eb617d768fabb23fa1eb617d7642
UTF-8 è¾°a}vè¾°a}vB 11000011101010001100001010111110110000101011000001100001011111010111011011000011101010001100001010111110110000101011000001100001011111010111011001000010 c3a8c2bec2b0617d76c3a8c2bec2b0617d7642
UHC ?¾°a}v?¾°a}vB 0011111110101000111110101010000111000110011000010111110101110110001111111010100011111010101000011100011001100001011111010111011001000010 3fa8faa1c6617d763fa8faa1c6617d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)