To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 佃???提俸? 10010010110011110011111100111111001111111001001011110001100101011110111000111111 92cf3f3f3f92f195ee3f
EUC-JP 佃???提俸? 11000100110100010011111100111111001111111100010011110011110010101111000000111111 c4d13f3f3fc4f3caf03f
UTF-8 佃꿰렍렖提俸황 111001001011110110000011111010101011111110110000111010111010000010001101111010111010000010010110111001101000111110010000111001001011111110111000111011011001100110101001 e4bd83eabfb0eba08deba096e68f90e4bfb8ed99a9
UHC 佃꿰렍렖提俸황 1110111011101100101100101110011110001110101000111000111010101011111100001010101111011100111001001100100010110010 eeecb2e78ea38eabf0abdce4c8b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)