To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 藥??岳??秧??鳶??央?塋ゅⅹ 11100101010110100011111100111111100010100111100000111111001111111110001001011110001111110011111110010011110011100011111100111111100010011001101100111111100110101100100010000010111000111111101001001001 e55a3f3f8a783f3fe25e3f3f93ce3f3f899b3f9ac882e3fa49
EUC-JP 藥??岳??秧??鳶??央?塋ゅ? 111010011011101100111111001111111011001111011001001111110011111111100011101111110011111100111111110001101101000000111111001111111011000111111011001111111101010011001010101001001110010100111111 e9bb3f3fb3d93f3fe3bf3f3fc6d03f3fb1fb3fd4caa4e53f
UTF-8 藥썸렋岳껇갬秧녑뜵鳶멩뜆央놩塋ゅⅹ 111010001001011110100101111011001000110110111000111010111010000010001011111001011011001010110011111010101011101110000111111010101011000010101100111001111010011110100111111010111000010110010001111010111001110010110101111010011011001110110110111010111010100110101001111010111001110010000110111001011010010010101110111010111000011010101001111001011010000110001011111000111000001010000101111000101000010110111001 e897a5ec8db8eba08be5b2b3eabb87eab0ace7a7a7eb8591eb9cb5e9b3b6eba9a9eb9c86e5a4aeeb86a9e5a18be38285e285b9
UHC 藥썸렋岳껇갬秧녑뜵鳶멩뜆央놩塋ゅⅹ 11100101101101111011110111100110100011101010001011100100101111111000001111101000101100001011011111100100111010111011001111100101100011011011001111100110111010011011100011100110100011011000100111100100111001111000011101001011111001111010101110101010111001011010010110101010 e5b7bde68ea2e4bf83e8b0b7e4ebb3e58db3e6e9b8e68d89e4e7874be7abaae5a5aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)