To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??????葉??抑????????葉??抑??^ 0011111100111111001111110011111100111111001111111001011101110100001111110011111110010111011111010011111100111111001111110011111100111111001111110011111100111111100101110111010000111111001111111001011101111101001111110011111101011110 3f3f3f3f3f3f97743f3f977d3f3f3f3f3f3f3f3f97743f3f977d3f3f5e
EUC-JP ??????葉??抑????????葉??抑??^ 0011111100111111001111110011111100111111001111111100110111010101001111110011111111001101110111100011111100111111001111110011111100111111001111110011111100111111110011011101010100111111001111111100110111011110001111110011111101011110 3f3f3f3f3f3fcdd53f3fcdde3f3f3f3f3f3f3f3fcdd53f3fcdde3f3f5e
UTF-8 銳랁룉溜곕젾葉뗫젚抑띠뙣銳랁룉溜곕젾葉뗫젚抑띠뙟^ 11101001100010101011001111101011100111101000000111101011101000111000100111101111101001111000101111101010101100111001010111101100101000001011111011101000100100011000100111101011100101111010101111101100101000001001101011100110100010101001000111101011100111011010000011101011100110011010001111101001100010101011001111101011100111101000000111101011101000111000100111101111101001111000101111101010101100111001010111101100101000001011111011101000100100011000100111101011100101111010101111101100101000001001101011100110100010101001000111101011100111011010000011101011100110011001111101011110 e98ab3eb9e81eba389efa78beab395eca0bee89189eb97abeca09ae68a91eb9da0eb99a3e98ab3eb9e81eba389efa78beab395eca0bee89189eb97abeca09ae68a91eb9da0eb999f5e
UHC 銳랁룉溜곕젾葉뗫젚抑띠뙣銳랁룉溜곕젾葉뗫젚抑띠뙟^ 11100111111001011000110111101101100011111000100011101010111111101011000011101011101000001011000011100111101010001000101111101011101000001001011011100101111001001011011011101100100011001010100011100111111001011000110111101101100011111000100011101010111111101011000011101011101000001011000011100111101010001000101111101011101000001001011011100101111001001011011011101100100011001010010001011110 e7e58ded8f88eafeb0eba0b0e7a88beba096e5e4b6ec8ca8e7e58ded8f88eafeb0eba0b0e7a88beba096e5e4b6ec8ca45e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)