To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 淨?????程???? 10011111110001000011111100111111001111110011111100111111100100101111011000111111001111110011111100111111 9fc43f3f3f3f3f92f63f3f3f3f
EUC-JP 淨???焌?程?釪?? 1101111011000110001111110011111100111111100011111100100111101000001111111100010011111000001111111000111111100011101011010011111100111111 dec63f3f3f8fc9e83fc4f83f8fe3ad3f3f
UTF-8 淨렞渽렜焌렠程렣釪당밞 111001101011011110101000111010111010000010011110111001101011100010111101111010111010000010011100111001111000010010001100111010111010000010100000111001111010100010001011111010111010000010100011111010011000011110101010111010111000101110111001111010111011000010011110 e6b7a8eba09ee6b8bdeba09ce7848ceba0a0e7a88beba0a3e987aaeb8bb9ebb09e
UHC 淨렞渽렜焌렠程렣釪당밞 11101111111001001000111010101111111011101010101010001110101011101111000111100000100011101011000111101111111011111000111010110100111010011110100110110100111001111011100111100001 efe48eafeeaa8eaef1e08eb1efef8eb4e9e9b4e7b9e1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)