To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 Øü]WznfØü]Wzn^}YØü]WznfØü]Wzn^}bE 110110001111110001011101010101110111101001101110011001101101100011111100010111010101011101111010011011100101111001111101010110011101100011111100010111010101011101111010011011100110011011011000111111000101110101010111011110100110111001011110011111010110001001000101 d8fc5d577a6e66d8fc5d577a6e5e7d59d8fc5d577a6e66d8fc5d577a6e5e7d6245
SJIS-WIN ??]Wznf??]Wzn^}Y??]Wznf??]Wzn^}bE 001111110011111101011101010101110111101001101110011001100011111100111111010111010101011101111010011011100101111001111101010110010011111100111111010111010101011101111010011011100110011000111111001111110101110101010111011110100110111001011110011111010110001001000101 3f3f5d577a6e663f3f5d577a6e5e7d593f3f5d577a6e663f3f5d577a6e5e7d6245
EUC-JP Øü]WznfØü]Wzn^}YØü]WznfØü]Wzn^}bE 10001111101010011010110010001111101010111110010001011101010101110111101001101110011001101000111110101001101011001000111110101011111001000101110101010111011110100110111001011110011111010101100110001111101010011010110010001111101010111110010001011101010101110111101001101110011001101000111110101001101011001000111110101011111001000101110101010111011110100110111001011110011111010110001001000101 8fa9ac8fabe45d577a6e668fa9ac8fabe45d577a6e5e7d598fa9ac8fabe45d577a6e668fa9ac8fabe45d577a6e5e7d6245
UTF-8 Øü]WznfØü]Wzn^}YØü]WznfØü]Wzn^}bE 1100001110011000110000111011110001011101010101110111101001101110011001101100001110011000110000111011110001011101010101110111101001101110010111100111110101011001110000111001100011000011101111000101110101010111011110100110111001100110110000111001100011000011101111000101110101010111011110100110111001011110011111010110001001000101 c398c3bc5d577a6e66c398c3bc5d577a6e5e7d59c398c3bc5d577a6e66c398c3bc5d577a6e5e7d6245
UHC Ø?]WznfØ?]Wzn^}YØ?]WznfØ?]Wzn^}bE 10101000101010100011111101011101010101110111101001101110011001101010100010101010001111110101110101010111011110100110111001011110011111010101100110101000101010100011111101011101010101110111101001101110011001101010100010101010001111110101110101010111011110100110111001011110011111010110001001000101 a8aa3f5d577a6e66a8aa3f5d577a6e5e7d59a8aa3f5d577a6e66a8aa3f5d577a6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)