To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 瘟??熬??泳??鳶??耶??兀??鳶?? 11100001100010010011111100111111111000001001001000111111001111111000100101101010001111110011111110010011110011100011111100111111100101101110101100111111001111111001100101011001001111110011111110010011110011100011111100111111 e1893f3fe0923f3f896a3f3f93ce3f3f96eb3f3f99593f3f93ce3f3f
EUC-JP 瘟??熬??泳??鳶??耶??兀??鳶?? 11100001111010010011111100111111110111111111001000111111001111111011000111001011001111110011111111000110110100000011111100111111110011001110110100111111001111111101000110111010001111110011111111000110110100000011111100111111 e1e93f3fdff23f3fb1cb3f3fc6d03f3fcced3f3fd1ba3f3fc6d03f3f
UTF-8 瘟룩큹熬뽬꽦泳싦퓘鳶멩짎耶섋갬兀덂뜏鳶멱쥤 111001111001100010011111111010111010001110101001111011011000000110111001111001111000011010101100111010111011110110101100111010101011110110100110111001101011001110110011111011001000101110100110111011011001001110011000111010011011001110110110111010111010100110101001111011001010011110001110111010001000000010110110111011001000010010001011111010101011000010101100111001011000010110000000111010111000110110000010111010111001110010001111111010011011001110110110111010111010100110110001111011001010010110100100 e7989feba3a9ed81b9e786acebbdaceabda6e6b3b3ec8ba6ed9398e9b3b6eba9a9eca78ee880b6ec848beab0ace58580eb8d82eb9c8fe9b3b6eba9b1eca5a4
UHC 瘟룩큹熬뽬꽦泳싦퓘鳶멩짎耶섋갬兀덂뜏鳶멱쥤 111010001011000010110111111010001011010010001000111010001010001010010110111010001000010010110001111001111011011010011010111001001011111110000011111001101110100110111000111001101010001110011010111001011010110110011000111010001011000010110111111010001011010010001000111001011000110110010010111001101110100110111000111010001010001010010110 e8b0b7e8b488e8a296e884b1e7b69ae4bf83e6e9b8e6a39ae5ad98e8b0b7e8b488e58d92e6e9b8e8a296

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)