To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 乳℡?楯??? 10010011111110111000011110000100001111111000111101111100001111110011111100111111 93fb87843f8f7c3f3f3f
EUC-JP 乳??楯??? 110001101111110100111111001111111011110111011101001111110011111100111111 c6fd3f3fbddd3f3f3f
UTF-8 乳℡썬楯폈렒렦 111001001011100110110011111000101000010010100001111011001000110110101100111001101010010110101111111011011000111110001000111010111010000010010010111010111010000010100110 e4b9b3e284a1ec8dace6a5afed8f88eba092eba0a6
UHC 乳℡썬楯폈렒렦 1110101011100001101000101110010110111101111000111110001011100100110001101111000110001110101001111000111010110101 eae1a2e5bde3e2e4c6f18ea78eb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)