To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘖?????飮??癰??擬??熬??吟?? 100111110101000000111111001111110011111100111111001111111001111101011010001111110011111111100001100111100011111100111111100010110101101100111111001111111110000010010010001111110011111110001011111000010011111100111111 9f503f3f3f3f3f9f5a3f3fe19e3f3f8b5b3f3fe0923f3f8be13f3f
EUC-JP 蘖?????飮??癰??擬??熬??吟?? 110111011011000100111111001111110011111100111111001111111101110110111011001111110011111111100001111111100011111100111111101101011011110000111111001111111101111111110010001111110011111110110110111000110011111100111111 ddb13f3f3f3f3fddbb3f3fe1fe3f3fb5bc3f3fdff23f3fb6e33f3f
UTF-8 蘖뽮퉭栒뤻씣飮뗭춳癰귘뼠擬쀫눢熬곣뫀吟띸춯 111010001001100010010110111010111011110110101110111011011000100110101101111001101010000010010010111010111010010010111011111011001001010010100011111010011010001110101110111010111001011110101101111011001011011010110011111001111001100110110000111010101011011110011000111010111011110010100000111001101001001110101100111011001000000010101011111010111000100010100010111001111000011010101100111010101011001110100011111010111010101110000000111001011001000010011111111010111001110110111000111011001011011010101111 e89896ebbdaeed89ade6a092eba4bbec94a3e9a3aeeb97adecb6b3e799b0eab798ebbca0e693acec80abeb88a2e786aceab3a3ebab80e5909feb9db8ecb6af
UHC 蘖뽮퉭栒뤻씣飮뗭춳癰귘뼠擬쀫눢熬곣뫀吟띸춯 111001011110111010010110111010101011100110000101111000101110001110001111111010011001110110110111111010111110011010001011111011001010110110001111111010001011100110000010111000101001011010100011111010111111010010010111111010111000011110111001111010001010001010000001111000101001000110100100111010111110000110001101111001111010110110001100 e5ee96eab985e2e38fe99db7ebe68becad8fe8b982e296a3ebf497eb87b9e8a281e291a4ebe18de7ad8c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)