To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???乙??害??^ 001111110011111100111111100010011011001100111111001111111000101001010001001111110011111101011110 3f3f3f89b33f3f8a513f3f5e
EUC-JP ???乙??害??^ 001111110011111100111111101100101011010100111111001111111011001110110010001111110011111101011110 3f3f3fb2b53f3fb3b23f3f5e
UTF-8 蓮웜첌乙당눨害숈궛^ 11101111101001101001100111101100100110111001110011101100101100101000110011100100101110011001100111101011100010111011100111101011100010001010100011100101101011101011001111101100100010001000100011101010101101101001101101011110 efa699ec9b9cecb28ce4b999eb8bb9eb88a8e5aeb3ec8888eab69b5e
UHC 蓮웜첌乙당눨害숈궛^ 11100110111001011011111111111010101010101001100111101011111000001011010011100111100001111011111111111010101010101001100111101100100000101011000001011110 e6e5bffaaa99ebe0b4e787bffaaa99ec82b05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)