To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????碩????益 001111110011111100111111001111111001000011010111001111110011111100111111001111111000100101110110 3f3f3f3f90d73f3f3f3f8976
EUC-JP ????碩????益 001111110011111100111111001111111100000011011001001111110011111100111111001111111011000111010111 3f3f3f3fc0d93f3f3f3fb1d7
UTF-8 溺솜臨셀碩溺솜鱗셜益 111011111010011110101100111011001000011010011100111011111010011110110110111011001000010110000000111001111010001010101001111011111010011110101100111011001000011010011100111011111010011110110010111011001000010110011100111001111001101110001010 efa7acec869cefa7b6ec8580e7a2a9efa7acec869cefa7b2ec859ce79b8a
UHC 溺솜臨셀碩溺솜鱗셜益 1110110011001010101111001101100011101100111110101011110010111111111000001011010111101100110010101011110011011000111011001110011110111100110010001110110011001100 eccabcd8ecfabcbfe0b5eccabcd8ece7bcc8eccc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)