To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 崧而岦失崧而岦失n}崧而岦失崧而岦失n{^ 11111010101011111000111010100111111110101010110010001110101110001111101010101111100011101010011111111010101011001000111010111000011011100111110111111010101011111000111010100111111110101010110010001110101110001111101010101111100011101010011111111010101011001000111010111000011011100111101101011110 faaf8ea7faac8eb8faaf8ea7faac8eb86e7dfaaf8ea7faac8eb8faaf8ea7faac8eb86e7b5e
EUC-JP 崧而岦失崧而岦失n}崧而岦失崧而岦失n{^ 100011111011101111001010101111001010100110001111101110111011001110111100101110101000111110111011110010101011110010101001100011111011101110110011101111001011101001101110011111011000111110111011110010101011110010101001100011111011101110110011101111001011101010001111101110111100101010111100101010011000111110111011101100111011110010111010011011100111101101011110 8fbbcabca98fbbb3bcba8fbbcabca98fbbb3bcba6e7d8fbbcabca98fbbb3bcba8fbbcabca98fbbb3bcba6e7b5e
UTF-8 崧而岦失崧而岦失n}崧而岦失崧而岦失n{^ 1110010110110100101001111110100010000000100011001110010110110010101001101110010110100100101100011110010110110100101001111110100010000000100011001110010110110010101001101110010110100100101100010110111001111101111001011011010010100111111010001000000010001100111001011011001010100110111001011010010010110001111001011011010010100111111010001000000010001100111001011011001010100110111001011010010010110001011011100111101101011110 e5b4a7e8808ce5b2a6e5a4b1e5b4a7e8808ce5b2a6e5a4b16e7de5b4a7e8808ce5b2a6e5a4b1e5b4a7e8808ce5b2a6e5a4b16e7b5e
UHC 崧而?失崧而?失n}崧而?失崧而?失n{^ 111000101111111011101100101110110011111111100011111101111110001011111110111011001011101100111111111000111111011101101110011111011110001011111110111011001011101100111111111000111111011111100010111111101110110010111011001111111110001111110111011011100111101101011110 e2feecbb3fe3f7e2feecbb3fe3f76e7de2feecbb3fe3f7e2feecbb3fe3f76e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)