To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鴨??宜?????液??宥??暎?????^ 100010101001101100111111001111111000101101011000001111110011111100111111001111110011111110001001011101000011111100111111100101110100011100111111001111111001110111110011001111110011111100111111001111110011111101011110 8a9b3f3f8b583f3f3f3f3f89743f3f97473f3f9df33f3f3f3f3f5e
EUC-JP 鴨??宜?????液??宥??暎?????^ 101100111111101100111111001111111011010110111001001111110011111100111111001111110011111110110001110101010011111100111111110011011010100000111111001111111101101011110101001111110011111100111111001111110011111101011110 b3fb3f3fb5b93f3f3f3f3fb1d53f3fcda83f3fdaf53f3f3f3f3f5e
UTF-8 鴨뗫떦宜긺꺌溜경옗液ㅵ뇣宥뱀벑暎㏓젾女앶턄^ 11101001101101001010100011101011100101111010101111101011100101101010011011100101101011101001110011101010101110001011101011101010101110101000110011101111101001111000101111101010101100101011110111101100100110001001011111100110101101101011001011100011100001011011010111101011100001111010001111100101101011101010010111101011101100011000000011101011101100101001000111100110100110101000111011100011100011111001001111101100101000001011111011101111101001101000000111101100100101011011011011101101100001001000010001011110 e9b4a8eb97abeb96a6e5ae9ceab8baeaba8cefa78beab2bdec9897e6b6b2e385b5eb87a3e5aea5ebb180ebb291e69a8ee38f93eca0beefa681ec95b6ed84845e
UHC 鴨뗫떦宜긺꺌溜경옗液ㅵ뇣宥뱀벑暎㏓젾女앶턄^ 11100100111001011000101111101011100010111011100111101011111100011011000111100111101100101010011111101010111111101011000011100110100111101001110111100100111110111010010011100101100001111000101111101010111010011011100111101100100100111011000111100111101100101010011111101011101000001011000011100101111111001001110111101001101101011010000001011110 e4e58beb8bb9ebf1b1e7b2a7eafeb0e69e9de4fba4e5878beae9b9ec93b1e7b2a7eba0b0e5fc9de9b5a05e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)