To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????洵℡?雅??瘟???よ。 001111110011111100111111001111110011111100111111001111110011111100111111100111111010101110000111100001000011111110001001111010110011111100111111111000011000100100111111001111110011111110000010111001101000000101000010 3f3f3f3f3f3f3f3f3f9fab87843f89eb3f3fe1893f3f3f82e68142
EUC-JP 獒????????洵??雅??瘟???よ。 10001111110010111011101100111111001111110011111100111111001111110011111100111111001111111101111010101101001111110011111110110010111011010011111100111111111000011110100100111111001111110011111110100100111010001010000110100011 8fcbbb3f3f3f3f3f3f3f3fdead3f3fb2ed3f3fe1e93f3f3fa4e8a1a3
UTF-8 獒쎈젵料ㅺ퀋溜롥굄洵℡옡雅섎졁瘟욘쵊溜よ。 111001111000110110010010111011001000111010001000111011001010000010110101111011111010011010111110111000111000010110111010111011011000000010001011111011111010011110001011111010111010000110100101111010101011010110000100111001101011010010110101111000101000010010100001111011001001100010100001111010011001101110000101111011001000010010001110111011001010000110000001111001111001100010011111111011001001101010011000111011001011010110001010111011111010011110001011111000111000001010001000111000111000000010000010 e78d92ec8e88eca0b5efa6bee385baed808befa78beba1a5eab584e6b4b5e284a1ec98a1e99b85ec848eeca181e7989fec9a98ecb58aefa78be38288e38082
UHC 獒쎈젵料ㅺ퀋溜롥굄洵℡옡雅섎졁瘟욘쵊溜よ。 111010001010001110111101111010111010000010101001111010001111011110100100111010101011001110000001111010101111111010001110111001011011000110101111111000101110011110100010111001011001111010100011111001001011101010011000111010111010000010110010111010001011000010111111111001101010110010001100111010101111111010101010111010001010000110100011 e8a3bdeba0a9e8f7a4eab381eafe8ee5b1afe2e7a2e59ea3e4ba98eba0b2e8b0bfe6ac8ceafeaae8a1a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)