To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???寃??淫??腰??幼??幼????? 0011111100111111001111111001101110000011001111110011111110001000111110100011111100111111100011011001100000111111001111111001011101100011001111110011111110010111011000110011111100111111001111110011111100111111 3f3f3f9b833f3f88fa3f3f8d983f3f97633f3f97633f3f3f3f3f
EUC-JP ???寃??淫??腰??幼??幼????? 0011111100111111001111111101010111100011001111110011111110110000111111000011111100111111101110011111100000111111001111111100110111000100001111110011111111001101110001000011111100111111001111110011111100111111 3f3f3fd5e33f3fb0fc3f3fb9f83f3fcdc43f3fcdc43f3f3f3f3f
UTF-8 列룸떣寃덃걗淫롳폋腰밟븞幼껆뙴幼곷뭅淋⑴뙴 111011111010011010011100111010111010001110111000111010111001011010100011111001011010111110000011111010111000110110000011111010101011000110010111111001101011011110101011111010111010000110110011111011011000111110001011111010001000010110110000111010111011000010011111111010111011100010011110111001011011100110111100111010101011101110000110111010111001100110110100111001011011100110111100111010101011001110110111111010111010110110000101111011111010011110110101111000101001000110110100111010111001100110110100 efa69ceba3b8eb96a3e5af83eb8d83eab197e6b7abeba1b3ed8f8be885b0ebb09febb89ee5b9bceabb86eb99b4e5b9bceab3b7ebad85efa7b5e291b4eb99b4
UHC 列룸떣寃덃걗淫롳폋腰밟븞幼껆뙴幼곷뭅淋⑴뙴 111001101110101010110111111010111000101110110111111010101011001010001000111001101000000110000010111010111110001010001110111011111011110010010110111010011010011010111001111000101001010110001000111010101110101010000011111001111000110010110111111010101110101010000001111010111011100110110100111011001111100010101001111001111000110010110111 e6eab7eb8bb7eab288e68182ebe28eefbc96e9a6b9e29588eaea83e78cb7eaea81ebb9b4ecf8a9e78cb7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)