To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 鈬エ辜狗羔辜毅 11100111111000111011010011100111100000111000101111100111111000111011010011100111100000111000101101000010 e7e3b4e7838be7e3b4e7838b42
EUC-JP 鈬エ辜狗羔辜毅 1110111011100101100011101011010011101101111000111011011011101001111001101011011011101101111000111011010110100011 eee58eb4ede3b6e9e6b6ede3b5a3
UTF-8 鈬エ辜狗羔辜毅 111010011000100010101100111011111011110110110100111010001011111010011100111001111000101110010111111001111011111010010100111010001011111010011100111001101010111110000101 e988acefbdb4e8be9ce78b97e7be94e8be9ce6af85
UHC ??辜狗羔辜毅 001111110011111111001101110100001100111110110111110011011100010011001101110100001110101111110110 3f3fcdd0cfb7cdc4cdd0ebf6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)