To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 恁??姨??夷???m?恁??姨??夷???m?B 100111001000110000111111001111111001101101001000001111110011111110001000110011100011111100111111001111111000001010001101001111111001110010001100001111110011111110011011010010000011111100111111100010001100111000111111001111110011111110000010100011010011111101000010 9c8c3f3f9b483f3f88ce3f3f3f828d3f9c8c3f3f9b483f3f88ce3f3f3f828d3f42
EUC-JP 恁??姨??夷???m?恁??姨??夷???m?B 110101111110110000111111001111111101010110101001001111110011111110110000110100000011111100111111001111111010001111101101001111111101011111101100001111110011111111010101101010010011111100111111101100001101000000111111001111110011111110100011111011010011111101000010 d7ec3f3fd5a93f3fb0d03f3f3fa3ed3fd7ec3f3fd5a93f3fb0d03f3f3fa3ed3f42
UTF-8 恁㏉쉵姨먯껨夷붿콛淋m슃恁㏉쉵姨먯껨夷붿콛淋m슃B 11100110100000011000000111100011100011111000100111101100100010011011010111100101101001111010100011101011101010001010111111101010101110111010100011100101101001001011011111101011101101101011111111101100101111011001101111101111101001111011010111101111101111011000110111101100100010101000001111100110100000011000000111100011100011111000100111101100100010011011010111100101101001111010100011101011101010001010111111101010101110111010100011100101101001001011011111101011101101101011111111101100101111011001101111101111101001111011010111101111101111011000110111101100100010101000001101000010 e68181e38f89ec89b5e5a7a8eba8afeabba8e5a4b7ebb6bfecbd9befa7b5efbd8dec8a83e68181e38f89ec89b5e5a7a8eba8afeabba8e5a4b7ebb6bfecbd9befa7b5efbd8dec8a8342
UHC 恁㏉쉵姨먯껨夷붿콛淋m슃恁㏉쉵姨먯껨夷붿콛淋m슃B 11101100111101101010011111101101100110101000101111101100101010011001000011101100101100101011010111101100101010001001010011101100101100011001010011101100111110001010001111101101100110101001010111101100111101101010011111101101100110101000101111101100101010011001000011101100101100101011010111101100101010001001010011101100101100011001010011101100111110001010001111101101100110101001010101000010 ecf6a7ed9a8beca990ecb2b5eca894ecb194ecf8a3ed9a95ecf6a7ed9a8beca990ecb2b5eca894ecb194ecf8a3ed9a9542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)