To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 壤?????醫??壤??泣①?魏??永?? 10011010110111110011111100111111001111110011111100111111111001111100111000111111001111111001101011011111001111110011111110001011100000111000011101000000001111111110100110110000001111110011111110001001011010010011111100111111 9adf3f3f3f3f3fe7ce3f3f9adf3f3f8b8387403fe9b03f3f89693f3f
EUC-JP 壤??堉??醫??壤??泣??魏??永?? 1101010011100001001111110011111110001111101101111111110100111111001111111110111011010000001111110011111111010100111000010011111100111111101101011110001100111111001111111111001010110010001111110011111110110001110010100011111100111111 d4e13f3f8fb7fd3f3feed03f3fd4e13f3fb5e33f3ff2b23f3fb1ca3f3f
UTF-8 壤깆쥉堉삣쉽醫꾪뮎壤깆쥜泣①독魏됱댅永띕복 111001011010001110100100111010101011100110000110111011001010010110001001111001011010000010001001111011001000001010100011111011001000100110111101111010011000011010101011111010101011111010101010111010111010111010001110111001011010001110100100111010101011100110000110111011001010010110011100111001101011001110100011111000101001000110100000111010111000111110000101111010011010110110001111111010111001000010110001111010111000110010000101111001101011000010111000111010111001110110010101111010111011001110110101 e5a3a4eab986eca589e5a089ec82a3ec89bde986abeabeaaebae8ee5a3a4eab986eca59ce6b3a3e291a0eb8f85e9ad8feb90b1eb8c85e6b0b8eb9d95ebb3b5
UHC 壤깆쥉堉삣쉽醫꾪뮎壤깆쥜泣①독魏됱댅永띕복 111001011011110110110001111011001010001010000010111010111011110010111011111001011011110110110001111011001010001010000100111011011001001010011011111001011011110110110001111011001010001010010001111010111110100010101000111001111011010110110110111010101110000010001001111011001000100010101111111001111011010110110110111010111011101010111001 e5bdb1eca282ebbcbbe5bdb1eca284ed929be5bdb1eca291ebe8a8e7b5b6eae089ec88afe7b5b6ebbab9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)