To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鬩崎娼讀ヲ蠅橸スウ鬩崎娼讀ヲ蠅橸スウB 11101001101010011000110111101000100011111010100111100110101001001010011011100101101000101001111011101111101111011011001111101001101010011000110111101000100011111010100111100110101001001010011011100101101000101001111011101111101111011011001101000010 e9a98de88fa9e6a4a6e5a29eefbdb3e9a98de88fa9e6a4a6e5a29eefbdb342
EUC-JP 鬩崎娼讀ヲ蠅橸スウ鬩崎娼讀ヲ蠅橸スウB 11110010101010111011101011101010101111101010101111101100101001101000111010100110111010101010010011011100111100011000111010111101100011101011001111110010101010111011101011101010101111101010101111101100101001101000111010100110111010101010010011011100111100011000111010111101100011101011001101000010 f2abbaeabeabeca68ea6eaa4dcf18ebd8eb3f2abbaeabeabeca68ea6eaa4dcf18ebd8eb342
UTF-8 鬩崎娼讀ヲ蠅橸スウ鬩崎娼讀ヲ蠅橸スウB 11101001101011001010100111100101101101001000111011100101101010001011110011101000101011101000000011101111101111011010011011101000101000001000010111100110101010011011100011101111101111011011110111101111101111011011001111101001101011001010100111100101101101001000111011100101101010001011110011101000101011101000000011101111101111011010011011101000101000001000010111100110101010011011100011101111101111011011110111101111101111011011001101000010 e9aca9e5b48ee5a8bce8ae80efbda6e8a085e6a9b8efbdbdefbdb3e9aca9e5b48ee5a8bce8ae80efbda6e8a085e6a9b8efbdbdefbdb342
UHC ?崎娼讀?蠅????崎娼讀?蠅???B 001111111101000011111000111100111101111011010100110000010011111111100011101100100011111100111111001111110011111111010000111110001111001111011110110101001100000100111111111000111011001000111111001111110011111101000010 3fd0f8f3ded4c13fe3b23f3f3f3fd0f8f3ded4c13fe3b23f3f3f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)