To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 要??竊??榮??堰??抑??傲??節??^ 1001011101110110001111110011111111100010100001100011111100111111100111101100010000111111001111111000100110000001001111110011111110010111011111010011111100111111100110001111110000111111001111111001000011011111001111110011111101011110 97763f3fe2863f3f9ec43f3f89813f3f977d3f3f98fc3f3f90df3f3f5e
EUC-JP 要??竊??榮??堰??抑??傲??節??^ 1100110111010111001111110011111111100011111001100011111100111111110111001100011000111111001111111011000111100001001111110011111111001101110111100011111100111111110100001111111000111111001111111100000011100001001111110011111101011110 cdd73f3fe3e63f3fdcc63f3fb1e13f3fcdde3f3fd0fe3f3fc0e13f3f5e
UTF-8 要뺞녂竊껇럦榮뗰슴堰묌찕抑븝펱傲됬큾節욥쨰^ 11101000101001101000000111101011101110101001111011101011100001011000001011100111101010111000101011101010101110111000011111101011100111111010011011100110101001101010111011101011100101111011000011101100100010101011010011100101101000001011000011101011101011001000110011101100101100001001010111100110100010101001000111101011101110001001110111101101100011101011000111100101100000101011001011101011100100001010110011101101100000011011111011100111101011111000000011101100100110101010010111101100101010001011000001011110 e8a681ebba9eeb8582e7ab8aeabb87eb9fa6e6a6aeeb97b0ec8ab4e5a0b0ebac8cecb095e68a91ebb89ded8eb1e582b2eb90aced81bee7af80ec9aa5eca8b05e
UHC 要뺞녂竊껇럦榮뗰슴堰묌찕抑븝펱傲됬큾節욥쨰^ 11101001101010011001010111100110100001101011101011101111101111001000001111101000100011101000100111100111101101001000101111101111101111011011111111100101111010001001000111101001101010011001010111100101111001001011101011101111101111001000001111100111111011001000100111100111101101001000101111101111101111011011111111101001101001001000101001011110 e9a995e686baefbc83e88e89e7b48befbdbfe5e891e9a995e5e4baefbc83e7ec89e7b48befbdbfe9a48a5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)