To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 勇?????應??熬???∽?儀??甕 10010111010001010011111100111111001111110011111100111111100111001110010000111111001111111110000010010010001111110011111100111111100000011110010000111111100010110101011000111111001111111110000101010000 97453f3f3f3f3f9ce43f3fe0923f3f3f81e43f8b563f3fe150
EUC-JP 勇??佾??應??熬??堉∽?儀??甕 1100110110100110001111110011111110001111101100001111101100111111001111111101100011100110001111110011111111011111111100100011111100111111100011111011011111111101101000101110011000111111101101011011011100111111001111111110000110110001 cda63f3f8fb0fb3f3fd8e63f3fdff23f3f8fb7fda2e63fb5b73f3fe1b1
UTF-8 勇싳뮄佾믤쨫應뀄뫛熬곻퐢堉∽쬉儀볧뜑甕 111001011000101110000111111011001000101110110011111010111010111010000100111001001011110110111110111010111010111110100100111011001010100010101011111001101000011110001001111010111000000010000100111010111010101110011011111001111000011010101100111010101011001110111011111011011001000010100010111001011010000010001001111000101000100010111101111011001010110010001001111001011000010010000000111010111011001110100111111010111001110010010001111001111001010010010101 e58b87ec8bb3ebae84e4bdbeebafa4eca8abe68789eb8084ebab9be786aceab3bbed90a2e5a089e288bdecac89e58480ebb3a7eb9c91e79495
UHC 勇싳뮄佾믤쨫應뀄뫛熬곻퐢堉∽쬉儀볧뜑甕 1110100110111000100110101110110010010010100100111110110011101011100100101110011010100100100001011110101111101011101100101110110110010001101110111110100010100010100000011110111110111101100010111110101110111100101000011110111110100110100111111110101111110000100100111110110110001101100101001110100010111000 e9b89aec9293eceb92e6a485ebebb2ed91bbe8a281efbd8bebbca1efa69febf093ed8d94e8b8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)