To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
EUC-JP ???????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
UTF-8 梨좏삺吏쬩梨좏삺吏ㅼ콡吏좏삷梨좏삺吏ㅼ콡吏ㅽ삢B 11101111101001111010001011101100101000101000111111101100100000101011101011101111101001111001111011101100101011001010100111101111101001111010001011101100101000101000111111101100100000101011101011101111101001111001111011100011100001011011110011101100101111011010000111101111101001111001111011101100101000101000111111101100100000101011011111101111101001111010001011101100101000101000111111101100100000101011101011101111101001111001111011100011100001011011110011101100101111011010000111101111101001111001111011100011100001011011110111101100100000101010001001000010 efa7a2eca28fec82baefa79eecaca9efa7a2eca28fec82baefa79ee385bcecbda1efa79eeca28fec82b7efa7a2eca28fec82baefa79ee385bcecbda1efa79ee385bdec82a242
UHC 梨좏삺吏쬩梨좏삺吏ㅼ콡吏좏삷梨좏삺吏ㅼ콡吏ㅽ삢B 1110110010110001101000001110110110011000101100011110110010100111101001110101101011101100101100011010000011101101100110001011000111101100101001111010010011101100101100011001100111101100101001111010000011101101100110001010111011101100101100011010000011101101100110001011000111101100101001111010010011101100101100011001100111101100101001111010010011101101100110001010001101000010 ecb1a0ed98b1eca7a75aecb1a0ed98b1eca7a4ecb199eca7a0ed98aeecb1a0ed98b1eca7a4ecb199eca7a4ed98a342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)