To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????E 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 歪????????猥??歪????????猥??E 1001100001100011001111110011111100111111001111110011111100111111001111110011111111100000110011100011111100111111100110000110001100111111001111110011111100111111001111110011111100111111001111111110000011001110001111110011111101000101 98633f3f3f3f3f3f3f3fe0ce3f3f98633f3f3f3f3f3f3f3fe0ce3f3f45
EUC-JP 歪?????洧??猥??歪?????洧??猥??E 110011111100010000111111001111110011111100111111001111111000111111000111101101000011111100111111111000001101000000111111001111111100111111000100001111110011111100111111001111110011111110001111110001111011010000111111001111111110000011010000001111110011111101000101 cfc43f3f3f3f3f8fc7b43f3fe0d03f3fcfc43f3f3f3f3f8fc7b43f3fe0d03f3f45
UTF-8 歪뺣ㅆ麟딁툞洧멥뀒猥롮퇏歪뺣ㅆ麟딁툞洧멥뀒猥롮랜E 11100110101011011010101011101011101110101010001111100011100001011000011011101111101001111011001111101011100101001000000111101101100010001001111011100110101101001010011111101011101010011010010111101011100000001001001011100111100011001010010111101011101000011010111011101101100001111000111111100110101011011010101011101011101110101010001111100011100001011000011011101111101001111011001111101011100101001000000111101101100010001001111011100110101101001010011111101011101010011010010111101011100000001001001011100111100011001010010111101011101000011010111011101011100111101001110001000101 e6adaaebbaa3e38586efa7b3eb9481ed889ee6b4a7eba9a5eb8092e78ca5eba1aeed878fe6adaaebbaa3e38586efa7b3eb9481ed889ee6b4a7eba9a5eb8092e78ca5eba1aeeb9e9c45
UHC 歪뺣ㅆ麟딁툞洧멥뀒猥롮퇏歪뺣ㅆ麟딁툞洧멥뀒猥롮랜E 11101000111000001001010111101011101001001011011011101100111010001000101011100111101110001001010111101010111110111011100011100011100001011000110011101000111001011000111011101100101101111010000011101000111000001001010111101011101001001011011011101100111010001000101011100111101110001001010111101010111110111011100011100011100001011000110011101000111001011000111011101100101101111010001101000101 e8e095eba4b6ece88ae7b895eafbb8e3858ce8e58eecb7a0e8e095eba4b6ece88ae7b895eafbb8e3858ce8e58eecb7a345

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)