To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 葉??艶e????檍??宜??狎る????泣 100101110111010000111111001111111000100110010000100000101000010100111111001111110011111100111111100111101111100000111111001111111000101101011000001111110011111111100000101111101000001011101001001111110011111100111111001111111000101110000011 97743f3f899082853f3f3f3f9ef83f3f8b583f3fe0be82e93f3f3f3f8b83
EUC-JP 葉??艶e????檍??宜??狎る????泣 110011011101010100111111001111111011000111110000101000111110010100111111001111110011111100111111110111001111101000111111001111111011010110111001001111110011111111100000110000001010010011101011001111110011111100111111001111111011010111100011 cdd53f3fb1f0a3e53f3f3f3fdcfa3f3fb5b93f3fe0c0a4eb3f3f3f3fb5e3
UTF-8 葉뗫젫艶e넪溜곕젾檍됱뼐宜븀윢狎る젾溜경뇡泣 111010001001000110001001111010111001011110101011111011001010000010101011111010001000100110110110111011111011110110000101111010111000010010101010111011111010011110001011111010101011001110010101111011001010000010111110111001101010101010001101111010111001000010110001111010111011110010010000111001011010111010011100111010111011100010000000111011001001110010100010111001111000101110001110111000111000001010001011111011001010000010111110111011111010011110001011111010101011001010111101111010111000011110100001111001101011001110100011 e89189eb97abeca0abe889b6efbd85eb84aaefa78beab395eca0bee6aa8deb90b1ebbc90e5ae9cebb880ec9ca2e78b8ee3828beca0beefa78beab2bdeb87a1e6b3a3
UHC 葉뗫젫艶e넪溜곕젾檍됱뼐宜븀윢狎る젾溜경뇡泣 1110011110101000100010111110101110100000101000111110011011111101101000111110010110000110101010101110101011111110101100001110101110100000101100001110010111100101100010011110110010010110100110001110101111110001101110101110011110011111101000111110010011100100101010101110101110100000101100001110101011111110101100001110011010000111100010011110101111101000 e7a88beba0a3e6fda3e586aaeafeb0eba0b0e5e589ec9698ebf1bae79fa3e4e4aaeba0b0eafeb0e68789ebe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)