To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????×? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111101011100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fd73f
SJIS-WIN 壤?????轅??壤??泣①?袁l?壤?×裕 1001101011011111001111110011111100111111001111110011111111100111011101100011111100111111100110101101111100111111001111111000101110000011100001110100000000111111111001011100110110000010100011000011111110011010110111110011111110000001011111101001011101010100 9adf3f3f3f3f3fe7763f3f9adf3f3f8b8387403fe5cd828c3f9adf3f817e9754
EUC-JP 壤??堉??轅??壤??泣??袁l?壤?×裕 110101001110000100111111001111111000111110110111111111010011111100111111111011011101011100111111001111111101010011100001001111110011111110110101111000110011111100111111111010101100111110100011111011000011111111010100111000010011111110100001110111111100110110110101 d4e13f3f8fb7fd3f3fedd73f3fd4e13f3fb5e33f3feacfa3ec3fd4e13fa1dfcdb5
UTF-8 壤깆쥉堉사솈轅깅닅壤깆쥜泣①독袁l퐭壤깆×裕 1110010110100011101001001110101010111001100001101110110010100101100010011110010110100000100010011110110010000010101011001110110010000110100010001110100010111101100001011110101010111001100001011110101110001011100001011110010110100011101001001110101010111001100001101110110010100101100111001110011010110011101000111110001010010001101000001110101110001111100001011110100010100010100000011110111110111101100011001110110110010000101011011110010110100011101001001110101010111001100001101100001110010111111010001010001110010101 e5a3a4eab986eca589e5a089ec82acec8688e8bd85eab985eb8b85e5a3a4eab986eca59ce6b3a3e291a0eb8f85e8a281efbd8ced90ade5a3a4eab986c397e8a395
UHC 壤깆쥉堉사솈轅깅닅壤깆쥜泣①독袁l퐭壤깆×裕 1110010110111101101100011110110010100010100000101110101110111100101110111110011110011001100011001110101010111111101100011110101110001000100011101110010110111101101100011110110010100010100100011110101111101000101010001110011110110101101101101110101010111110101000111110110010111101100101101110010110111101101100011110110010100001101111111110101110101110 e5bdb1eca282ebbcbbe7998ceabfb1eb888ee5bdb1eca291ebe8a8e7b5b6eabea3ecbd96e5bdb1eca1bfebae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)