To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???H???????????????UB 001111110011111100111111010010000011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101010101000010 3f3f3f483f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5542
SJIS-WIN 狸旦捉H狸旦息狸則属狸旦息狸族存狸旦捉UB 100100100100101110010010010101011001000110101000010010001001001001001011100100100101010110010001101001111001001001001011100100011010010110010001101011101001001001001011100100100101010110010001101001111001001001001011100100011011000010010001101101101001001001001011100100100101010110010001101010000101010101000010 924b925591a848924b925591a7924b91a591ae924b925591a7924b91b091b6924b925591a85542
EUC-JP 狸旦捉H狸旦息狸則属狸旦息狸族存狸旦捉UB 110000111010110011000011101101101100001010101010010010001100001110101100110000111011011011000010101010011100001110101100110000101010011111000010101100001100001110101100110000111011011011000010101010011100001110101100110000101011001011000010101110001100001110101100110000111011011011000010101010100101010101000010 c3acc3b6c2aa48c3acc3b6c2a9c3acc2a7c2b0c3acc3b6c2a9c3acc2b2c2b8c3acc3b6c2aa5542
UTF-8 狸旦捉H狸旦息狸則属狸旦息狸族存狸旦捉UB 111001111000101110111000111001101001011110100110111001101000110110001001010010001110011110001011101110001110011010010111101001101110011010000001101011111110011110001011101110001110010110001001100001111110010110110001100111101110011110001011101110001110011010010111101001101110011010000001101011111110011110001011101110001110011010010111100011111110010110101101100110001110011110001011101110001110011010010111101001101110011010001101100010010101010101000010 e78bb8e697a6e68d8948e78bb8e697a6e681afe78bb8e58987e5b19ee78bb8e697a6e681afe78bb8e6978fe5ad98e78bb8e697a6e68d895542
UHC 狸旦捉H狸旦息狸則?狸旦息狸族存狸旦捉UB 1101011111100001110100111010100111110011101101010100100011010111111000011101001110101001111000111101001111010111111000011111011011001110001111111101011111100001110100111010100111100011110100111101011111100001111100001110100111110000111011011101011111100001110100111010100111110011101101010101010101000010 d7e1d3a9f3b548d7e1d3a9e3d3d7e1f6ce3fd7e1d3a9e3d3d7e1f0e9f0edd7e1d3a9f3b55542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)