To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 﨑湿峵リ鰔」鈐蒔n}﨑湿峵リ鰔」鈐蒔n{^ 111110101011000110001110101111001111101010101110110110001110100111010000101000111111101111000010100011101010101001101110011111011111101010110001100011101011110011111010101011101101100011101001110100001010001111111011110000101000111010101010011011100111101101011110 fab18ebcfaaed8e9d0a3fbc28eaa6e7dfab18ebcfaaed8e9d0a3fbc28eaa6e7b5e
EUC-JP ?湿?リ鰔」鈐蒔n}?湿?リ鰔」鈐蒔n{^ 0011111110111100101111100011111110001110110110001111001011010010100011101010001110001111111000111100000110111100101011000110111001111101001111111011110010111110001111111000111011011000111100101101001010001110101000111000111111100011110000011011110010101100011011100111101101011110 3fbcbe3f8ed8f2d28ea38fe3c1bcac6e7d3fbcbe3f8ed8f2d28ea38fe3c1bcac6e7b5e
UTF-8 﨑湿峵リ鰔」鈐蒔n}﨑湿峵リ鰔」鈐蒔n{^ 1110111110101000100100011110011010111001101111111110010110110011101101011110111110111110100110001110100110110000100101001110111110111101101000111110100110001000100100001110100010010010100101000110111001111101111011111010100010010001111001101011100110111111111001011011001110110101111011111011111010011000111010011011000010010100111011111011110110100011111010011000100010010000111010001001001010010100011011100111101101011110 efa891e6b9bfe5b3b5efbe98e9b094efbda3e98890e892946e7defa891e6b9bfe5b3b5efbe98e9b094efbda3e98890e892946e7b5e
UHC ??????鈐蒔n}??????鈐蒔n{^ 00111111001111110011111100111111001111110011111111001100101000101110001111001000011011100111110100111111001111110011111100111111001111110011111111001100101000101110001111001000011011100111101101011110 3f3f3f3f3f3fcca2e3c86e7d3f3f3f3f3f3fcca2e3c86e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)