To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??爰???≪?獄?????惟??亦 11100001100111110011111100111111111000001010011100111111001111110011111110000001111000010011111110001101100101100011111100111111001111110011111100111111100010001101001000111111001111111001011010010010 e19f3f3fe0a73f3f3f81e13f8d963f3f3f3f3f88d23f3f9692
EUC-JP 癲??爰??飡≪?獄??嫄??惟??亦 1110001010100001001111110011111111100000101010010011111100111111100011111110100011001000101000101110001100111111101110011111011000111111001111111000111110111010101000010011111100111111101100001101010000111111001111111100101111110010 e2a13f3fe0a93f3f8fe8c8a2e33fb9f63f3f8fbaa13f3fb0d43f3fcbf2
UTF-8 癲ㅺ퓭爰귝끽飡≪를獄쏄껸嫄띄뙠惟곌턂亦 111001111001100110110010111000111000010110111010111011011001001110101101111001111000100010110000111010101011011110011101111010111000000110111101111010011010001110100001111000101000100110101010111010111010010110111100111001111000110110000100111011001000111110000100111010101011101110111000111001011010101110000100111010111001110110000100111010111001100110100000111001101000001110011111111010101011001110001100111011011000010010000010111001001011101010100110 e799b2e385baed93ade788b0eab79deb81bde9a3a1e289aaeba5bce78d84ec8f84eabbb8e5ab84eb9d84eb99a0e6839feab38ced8482e4baa6
UHC 癲ㅺ퓭爰귝끽飡≪를獄쏄껸嫄띄뙠惟곌턂亦 1110111110100110101001001110101010111111100101001110101010111010100000101110011010110011101000111110000111100010101000011110110010111000101001101110100010101011100110111110101010110010101110011110101010110001101101101110011110001100101001011110101011101110101100001110101010110101100111101110011010110010 efa6a4eabf94eaba82e6b3a3e1e2a1ecb8a6e8ab9beab2b9eab1b6e78ca5eaeeb0eab59ee6b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)