To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??旬??淫??筌??違∽?音??筌 111000101010001100111111001111111000111101111011001111110011111110001000111110100011111100111111111000101010001100111111001111111000100011100001100000011110010000111111100010011011100100111111001111111110001010100011 e2a33f3f8f7b3f3f88fa3f3fe2a33f3f88e181e43f89b93f3fe2a3
EUC-JP 筌??旬??淫??筌??違∽?音??筌 111001001010010100111111001111111011110111011100001111110011111110110000111111000011111100111111111001001010010100111111001111111011000011100011101000101110011000111111101100101011101100111111001111111110010010100101 e4a53f3fbddc3f3fb0fc3f3fe4a53f3fb0e3a2e63fb2bb3f3fe4a5
UTF-8 筌뚮뱶旬썽씣淫묐윫筌뗫끃違∽쭓音뚰뮍筌 111001111010110110001100111010111001101010101110111010111011000110110110111001101001011110101100111011001000110110111101111011001001010010100011111001101011011110101011111010111010110010010000111011001001110010101011111001111010110110001100111010111001011110101011111010111000000110000011111010011000000110010101111000101000100010111101111011001010110110010011111010011001111110110011111010111001101010110000111010111010111010001101111001111010110110001100 e7ad8ceb9aaeebb1b6e697acec8dbdec94a3e6b7abebac90ec9cabe7ad8ceb97abeb8183e98195e288bdecad93e99fb3eb9ab0ebae8de7ad8c
UHC 筌뚮뱶旬썽씣淫묐윫筌뗫끃違∽쭓音뚰뮍筌 1110111110100111100011001110101110010011100111001110001011100010101111011110100110011101101101111110101111100010100100011110101110011111101010101110111110100111100010111110101110000101101110011110101011011110101000011110111110100111100010111110101111100101100011001110110110010010100110101110111110100111 efa78ceb939ce2e2bde99db7ebe291eb9faaefa78beb85b9eadea1efa78bebe58ced929aefa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)