To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??^}??^u???TL??^}??^u???TL^ 001111110011111101011110011111010011111100111111010111100111010100111111001111110011111101010100010011000011111100111111010111100111110100111111001111110101111001110101001111110011111100111111010101000100110001011110 3f3f5e7d3f3f5e753f3f3f544c3f3f5e7d3f3f5e753f3f3f544c5e
SJIS-WIN ??^}??^u???TL??^}??^u???TL^ 001111110011111101011110011111010011111100111111010111100111010100111111001111110011111101010100010011000011111100111111010111100111110100111111001111110101111001110101001111110011111100111111010101000100110001011110 3f3f5e7d3f3f5e753f3f3f544c3f3f5e7d3f3f5e753f3f3f544c5e
EUC-JP ??^}??^u???TL??^}??^u???TL^ 001111110011111101011110011111010011111100111111010111100111010100111111001111110011111101010100010011000011111100111111010111100111110100111111001111110101111001110101001111110011111100111111010101000100110001011110 3f3f5e7d3f3f5e753f3f3f544c3f3f5e7d3f3f5e753f3f3f544c5e
UTF-8 셔샹^}셔샹^u셍롉렢TL셔샹^}셔샹^u셍롉렢TL^ 11101100100001011001010011101100100000111011100101011110011111011110110010000101100101001110110010000011101110010101111001110101111011001000010110001101111010111010000110001001111010111010000010100010010101000100110011101100100001011001010011101100100000111011100101011110011111011110110010000101100101001110110010000011101110010101111001110101111011001000010110001101111010111010000110001001111010111010000010100010010101000100110001011110 ec8594ec83b95e7dec8594ec83b95e75ec858deba189eba0a2544cec8594ec83b95e7dec8594ec83b95e75ec858deba189eba0a2544c5e
UHC 셔샹^}셔샹^u셍롉렢TL셔샹^}셔샹^u셍롉렢TL^ 1011110011000101101111001010011101011110011111011011110011000101101111001010011101011110011101011011110011000100100011101100111110001110101100110101010001001100101111001100010110111100101001110101111001111101101111001100010110111100101001110101111001110101101111001100010010001110110011111000111010110011010101000100110001011110 bcc5bca75e7dbcc5bca75e75bcc48ecf8eb3544cbcc5bca75e7dbcc5bca75e75bcc48ecf8eb3544c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)