To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 燿??竊??榮??堰??抑??傲??節??^ 1110000010100000001111110011111111100010100001100011111100111111100111101100010000111111001111111000100110000001001111110011111110010111011111010011111100111111100110001111110000111111001111111001000011011111001111110011111101011110 e0a03f3fe2863f3f9ec43f3f89813f3f977d3f3f98fc3f3f90df3f3f5e
EUC-JP 燿??竊??榮??堰??抑??傲??節??^ 1110000010100010001111110011111111100011111001100011111100111111110111001100011000111111001111111011000111100001001111110011111111001101110111100011111100111111110100001111111000111111001111111100000011100001001111110011111101011110 e0a23f3fe3e63f3fdcc63f3fb1e13f3fcdde3f3fd0fe3f3fc0e13f3f5e
UTF-8 燿쒏녂竊껇럦榮뗰슴堰묌옖抑븝펱傲됬큾節욥쨰^ 11100111100001111011111111101100100100101000111111101011100001011000001011100111101010111000101011101010101110111000011111101011100111111010011011100110101001101010111011101011100101111011000011101100100010101011010011100101101000001011000011101011101011001000110011101100100110001001011011100110100010101001000111101011101110001001110111101101100011101011000111100101100000101011001011101011100100001010110011101101100000011011111011100111101011111000000011101100100110101010010111101100101010001011000001011110 e787bfec928feb8582e7ab8aeabb87eb9fa6e6a6aeeb97b0ec8ab4e5a0b0ebac8cec9896e68a91ebb89ded8eb1e582b2eb90aced81bee7af80ec9aa5eca8b05e
UHC 燿쒏녂竊껇럦榮뗰슴堰묌옖抑븝펱傲됬큾節욥쨰^ 11101000111111001001110011100110100001101011101011101111101111001000001111101000100011101000100111100111101101001000101111101111101111011011111111100101111010001001000111101001100111101001110011100101111001001011101011101111101111001000001111100111111011001000100111100111101101001000101111101111101111011011111111101001101001001000101001011110 e8fc9ce686baefbc83e88e89e7b48befbdbfe5e891e99e9ce5e4baefbc83e7ec89e7b48befbdbfe9a48a5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)