To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 壤??泣①?魏??永??泣①?袁l????裕? 100110101101111100111111001111111000101110000011100001110100000000111111111010011011000000111111001111111000100101101001001111110011111110001011100000111000011101000000001111111110010111001101100000101000110000111111001111110011111100111111100101110101010000111111 9adf3f3f8b8387403fe9b03f3f89693f3f8b8387403fe5cd828c3f3f3f3f97543f
EUC-JP 壤??泣??魏??永??泣??袁l????裕? 11010100111000010011111100111111101101011110001100111111001111111111001010110010001111110011111110110001110010100011111100111111101101011110001100111111001111111110101011001111101000111110110000111111001111110011111100111111110011011011010100111111 d4e13f3fb5e33f3ff2b23f3fb1ca3f3fb5e33f3feacfa3ec3f3f3f3fcdb53f
UTF-8 壤깆쥜泣①독魏됱뒴永띔쒀泣①독袁l퐭捻뚭여裕뉺 111001011010001110100100111010101011100110000110111011001010010110011100111001101011001110100011111000101001000110100000111010111000111110000101111010011010110110001111111010111001000010110001111010111001001010110100111001101011000010111000111010111001110110010100111011001001001010000000111001101011001110100011111000101001000110100000111010111000111110000101111010001010001010000001111011111011110110001100111011011001000010101101111011111010011010100100111010111001101010101101111011001001011110101100111010001010001110010101111010111000100110111010 e5a3a4eab986eca59ce6b3a3e291a0eb8f85e9ad8feb90b1eb92b4e6b0b8eb9d94ec9280e6b3a3e291a0eb8f85e8a281efbd8ced90adefa6a4eb9aadec97ace8a395eb89ba
UHC 壤깆쥜泣①독魏됱뒴永띔쒀泣①독袁l퐭捻뚭여裕뉺 11100101101111011011000111101100101000101001000111101011111010001010100011100111101101011011011011101010111000001000100111101100100010101010110111100111101101011011011011101010101111101010110011101011111010001010100011100111101101011011011011101010101111101010001111101100101111011001011011100110111101111000110011101010101111111010100111101011101011101000100001001010 e5bdb1eca291ebe8a8e7b5b6eae089ec8aade7b5b6eabeacebe8a8e7b5b6eabea3ecbd96e6f78ceabfa9ebae884a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)