To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???節g?謠??嵬???????ぜ節??^ 00111111001111110011111110010000110111111000001010000111001111111110011010001111001111110011111110011011110010100011111100111111001111110011111100111111001111110011111110000010101110101001000011011111001111110011111101011110 3f3f3f90df82873fe68f3f3f9bca3f3f3f3f3f3f3f82ba90df3f3f5e
EUC-JP ???節g?謠??嵬???????ぜ節??^ 00111111001111110011111111000000111000011010001111100111001111111110101111101111001111110011111111010110110011000011111100111111001111110011111100111111001111110011111110100100101111001100000011100001001111110011111101011110 3f3f3fc0e1a3e73febef3f3fd6cc3f3f3f3f3f3f3fa4bcc0e13f3f5e
UTF-8 簾앻맕節g줉謠쇽슝嵬뚨줉狀㏆숲簾앶ぜ節밭겮^ 11101111101001101010011011101100100101011011101111101011101001111001010111100111101011111000000011101111101111011000011111101100101001001000100111101000101011001010000011101100100001111011110111101100100010101001110111100101101101011010110011101011100110101010100011101100101001001000100111101111101001111011101011100011100011111000011011101100100010001011001011101111101001101010011011101100100101011011011011100011100000011001110011100111101011111000000011101011101100001010110111101010101100101010111001011110 efa6a6ec95bbeba795e7af80efbd87eca489e8aca0ec87bdec8a9de5b5aceb9aa8eca489efa7bae38f86ec88b2efa6a6ec95b6e3819ce7af80ebb0adeab2ae5e
UHC 簾앻맕節g줉謠쇽슝嵬뚨줉狀㏆숲簾앶ぜ節밭겮^ 11100111101000011001110111101110100100001010011111101111101111011010001111100111101000011001110111101001101010101011110011101111101111011011100111101000111000111000110011100111101000011001110111101101111011101010011111101111101111011010001111100111101000011001110111101001101010101011110011101111101111011011100111100111100000011011110001011110 e7a19dee90a7efbda3e7a19de9aabcefbdb9e8e38ce7a19dedeea7efbda3e7a19de9aabcefbdb9e781bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)