To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 塋k????塋c???ィ幼????ィ???^ 1001101011001000100000101000101100111111001111110011111100111111100110101100100010000010100000110011111100111111001111111000001101000010100101110110001100111111001111110011111100111111100000110100001000111111001111110011111101011110 9ac8828b3f3f3f3f9ac882833f3f3f834297633f3f3f3f83423f3f3f5e
EUC-JP 塋k????塋c???ィ幼????ィ???^ 1101010011001010101000111110101100111111001111110011111100111111110101001100101010100011111000110011111100111111001111111010010110100011110011011100010000111111001111110011111100111111101001011010001100111111001111110011111101011110 d4caa3eb3f3f3f3fd4caa3e33f3f3fa5a3cdc43f3f3f3fa5a33f3f3f5e
UTF-8 塋k뙎溜곕젾塋c끁閱뉒ィ幼볥젾溜곁ィ栒룩룎^ 11100101101000011000101111101111101111011000101111101011100110011000111011101111101001111000101111101010101100111001010111101100101000001011111011100101101000011000101111101111101111011000001111101011100000011000000111101001100101101011000111101011100010011001001011100011100000101010001111100101101110011011110011101011101100111010010111101100101000001011111011101111101001111000101111101010101100111000000111100011100000101010001111100110101000001001001011101011101000111010100111101011101000111000111001011110 e5a18befbd8beb998eefa78beab395eca0bee5a18befbd83eb8181e996b1eb8992e382a3e5b9bcebb3a5eca0beefa78beab381e382a3e6a092eba3a9eba38e5e
UHC 塋k뙎溜곕젾塋c끁閱뉒ィ幼볥젾溜곁ィ栒룩룎^ 11100111101010111010001111101011100011001001001111101010111111101011000011101011101000001011000011100111101010111010001111100011100001011011011111100110111100111000011111100111101010111010001111101010111010101001001111101011101000001011000011101010111111101011000011100111101010111010001111100010111000111011011111101000100011111000110001011110 e7aba3eb8c93eafeb0eba0b0e7aba3e385b7e6f387e7aba3eaea93eba0b0eafeb0e7aba3e2e3b7e88f8c5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)