To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???苒????????苒???????? 0011111100111111001111111110010010010010001111110011111100111111001111110011111100111111001111110011111111100100100100100011111100111111001111110011111100111111001111110011111100111111 3f3f3fe4923f3f3f3f3f3f3f3fe4923f3f3f3f3f3f3f3f
EUC-JP 琰??苒?????琰??苒?????琰?? 1000111111001100101101000011111100111111111001111111001000111111001111110011111100111111001111111000111111001100101101000011111100111111111001111111001000111111001111110011111100111111001111111000111111001100101101000011111100111111 8fccb43f3fe7f23f3f3f3f3f8fccb43f3fe7f23f3f3f3f3f8fccb43f3f
UTF-8 琰사뒧苒믥뒮劣사뒧琰사뒧苒믥뒮劣사즿琰사뒧 111001111001000010110000111011001000001010101100111010111001001010100111111010001000101110010010111010111010111110100101111010111001001010101110111011111010011010011101111011001000001010101100111010111001001010100111111001111001000010110000111011001000001010101100111010111001001010100111111010001000101110010010111010111010111110100101111010111001001010101110111011111010011010011101111011001000001010101100111011001010011010111111111001111001000010110000111011001000001010101100111010111001001010100111 e790b0ec82aceb92a7e88b92ebafa5eb92aeefa69dec82aceb92a7e790b0ec82aceb92a7e88b92ebafa5eb92aeefa69dec82aceca6bfe790b0ec82aceb92a7
UHC 琰사뒧苒믥뒮劣사뒧琰사뒧苒믥뒮劣사즿琰사뒧 111001101111110010111011111001111000101010100010111001101111111010010010111001111000101010100111111001101110101110111011111001111000101010100010111001101111110010111011111001111000101010100010111001101111111010010010111001111000101010100111111001101110101110111011111001111010001110010001111001101111110010111011111001111000101010100010 e6fcbbe78aa2e6fe92e78aa7e6ebbbe78aa2e6fcbbe78aa2e6fe92e78aa7e6ebbbe7a391e6fcbbe78aa2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)