To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????猷⑤?歪??蟻??語⑤9魏 001111110011111100111111001111110011111100111111100101110101000110000111010001000011111110011000011000110011111100111111100010110110000100111111001111111000110011101010100001110100010010000010010110001110100110110000 3f3f3f3f3f3f975187443f98633f3f8b613f3f8cea87448258e9b0
EUC-JP ???靷??猷??歪??蟻??語?9魏 001111110011111100111111100011111110011110111101001111110011111111001101101100100011111100111111110011111100010000111111001111111011010111000010001111110011111110111000111011000011111110100011101110011111001010110010 3f3f3f8fe7bd3f3fcdb23f3fcfc43f3fb5c23f3fb8ec3fa3b9f2b2
UTF-8 僚녹뼔靷뽫댚猷⑤븶歪묎쉈蟻뤿븶語⑤9魏 111011111010011010111011111010111000010110111001111010111011110010010100111010011001110110110111111010111011110110101011111010111000110010011010111001111000110010110111111000101001000110100100111010111011100010110110111001101010110110101010111010111010110010001110111011001000100110001000111010001001111110111011111010111010010010111111111010111011100010110110111010001010101010011110111000101001000110100100111011111011110010011001111010011010110110001111 efa6bbeb85b9ebbc94e99db7ebbdabeb8c9ae78cb7e291a4ebb8b6e6adaaebac8eec8988e89fbbeba4bfebb8b6e8aa9ee291a4efbc99e9ad8f
UHC 僚녹뼔靷뽫댚猷⑤븶歪묎쉈蟻뤿븶語⑤9魏 1110100011101000101100111110110010010110100111001110110011100110100101101110011110001000101111101110101110100011101010001110101110010101100111111110100011100000100100011110101010111101101001011110101111111100100011111110101110010101100111111110010111011110101010001110101110100011101110011110101011100000 e8e8b3ec969cece696e788beeba3a8eb959fe8e091eabda5ebfc8feb959fe5dea8eba3b9eae0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)