To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 寃?├熊耕???嬪}寃?├熊耕???嬪{^ 10011011100000110011111110000100101001011000110001000110100011010110101100111111001111110011111110011011011011000111110110011011100000110011111110000100101001011000110001000110100011010110101100111111001111110011111110011011011011000111101101011110 9b833f84a58c468d6b3f3f3f9b6c7d9b833f84a58c468d6b3f3f3f9b6c7b5e
EUC-JP 寃?├熊耕???嬪}寃?├熊耕???嬪{^ 11010101111000110011111110101000101001111011011110100111101110011100110000111111001111110011111111010101110011010111110111010101111000110011111110101000101001111011011110100111101110011100110000111111001111110011111111010101110011010111101101011110 d5e33fa8a7b7a7b9cc3f3f3fd5cd7dd5e33fa8a7b7a7b9cc3f3f3fd5cd7b5e
UTF-8 寃양├熊耕ㄶ炡멸嬪}寃양├熊耕ㄶ炡멸嬪{^ 111001011010111110000011111011001001011010010001111000101001010010011100111001111000011010001010111010001000000010010101111000111000010010110110111001111000001010100001111010111010100110111000111001011010110010101010011111011110010110101111100000111110110010010110100100011110001010010100100111001110011110000110100010101110100010000000100101011110001110000100101101101110011110000010101000011110101110101001101110001110010110101100101010100111101101011110 e5af83ec9691e2949ce7868ae88095e384b6e782a1eba9b8e5acaa7de5af83ec9691e2949ce7868ae88095e384b6e782a1eba9b8e5acaa7b5e
UHC 寃양├熊耕ㄶ炡멸嬪}寃양├熊耕ㄶ炡멸嬪{^ 111010101011001010111110111001111010011010100111111010101010100011001100111010011010010010100110111011111110100010111000111010101101111010101110011111011110101010110010101111101110011110100110101001111110101010101000110011001110100110100100101001101110111111101000101110001110101011011110101011100111101101011110 eab2bee7a6a7eaa8cce9a4a6efe8b8eadeae7deab2bee7a6a7eaa8cce9a4a6efe8b8eadeae7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)