To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 巽側尊巽側尊B 10010010010001101001000110100100100100011011100010010010010001101001000110100100100100011011100001000010 924691a491b8924691a491b842
EUC-JP 巽側尊巽側尊B 11000011101001111100001010100110110000101011101011000011101001111100001010100110110000101011101001000010 c3a7c2a6c2bac3a7c2a6c2ba42
UTF-8 巽側尊巽側尊B 11100101101101111011110111100101100000011011010011100101101100001000101011100101101101111011110111100101100000011011010011100101101100001000101001000010 e5b7bde581b4e5b08ae5b7bde581b4e5b08a42
UHC 巽側尊巽側尊B 11100001110111101111011010110000111100001110111011100001110111101111011010110000111100001110111001000010 e1def6b0f0eee1def6b0f0ee42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)