To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 兆?樟企??? 10010010100110110011111110001111101111101000101011101001001111110011111100111111 929b3f8fbe8ae93f3f3f
EUC-JP 兆?樟企??? 11000011111110110011111110111110110000001011010011101011001111110011111100111111 c3fb3fbec0b4eb3f3f3f
UTF-8 兆렗樟企렠쇤깡 111001011000010110000110111010111010000010010111111001101010100010011111111001001011110010000001111010111010000010100000111011001000011110100100111010101011100110100001 e58586eba097e6a89fe4bc81eba0a0ec87a4eab9a1
UHC 兆렗樟企렠쇤깡 1111000010111100100011101010110011101101111010011101000011101010100011101011000110111100111010011011000111111000 f0bc8eacede9d0ea8eb1bce9b1f8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)