To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 蒸?迂狡¬?? 1000111111110110001111111000100101001001111000001100001010000001110010100011111100111111 8ff63f8949e0c281ca3f3f
EUC-JP 蒸?迂狡¬?? 1011111011111000001111111011000110101010111000001100010010100010110011000011111100111111 bef83fb1aae0c4a2cc3f3f
UTF-8 蒸렣迂狡¬諪렍 111010001001001010111000111010111010000010100011111010001011111110000010111001111000101110100001111011111011111110100010111010001010101110101010111010111010000010001101 e892b8eba0a3e8bf82e78ba1efbfa2e8abaaeba08d
UHC 蒸렣迂狡¬諪렍 1111000111111010100011101011010011101001111001101100111011101010101000011111111011101111111101011000111010100011 f1fa8eb4e9e6ceeaa1feeff58ea3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)