To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??溢??矣??筌??源??儀??筌?Н 1110001010100011001111110011111110001000111011000011111100111111111000011110000100111111001111111110001010100011001111110011111110001100101110010011111100111111100010110101011000111111001111111110001010100011001111111000010001001110 e2a33f3f88ec3f3fe1e13f3fe2a33f3f8cb93f3f8b563f3fe2a33f844e
EUC-JP 筌??溢??矣??筌??源??儀??筌?Н 1110010010100101001111110011111110110000111011100011111100111111111000101110001100111111001111111110010010100101001111110011111110111000101110110011111100111111101101011011011100111111001111111110010010100101001111111010011110101111 e4a53f3fb0ee3f3fe2e33f3fe4a53f3fb8bb3f3fb5b73f3fe4a53fa7af
UTF-8 筌뚮뿨溢ㅿ쫶矣곗떵筌ㅻ씮源꾥돻儀뺤젘筌뗫Н 1110011110101101100011001110101110011010101011101110101110111111101010001110011010111010101000101110001110000101101111111110110010101011101101101110011110011111101000111110101010110011100101111110101110010110101101011110011110101101100011001110001110000101101110111110110010010100101011101110011010111010100100001110101010111110101001011110101110001111101110111110010110000100100000001110101110111010101001001110110010100000100110001110011110101101100011001110101110010111101010111101000010011101 e7ad8ceb9aaeebbfa8e6baa2e385bfecabb6e79fa3eab397eb96b5e7ad8ce385bbec94aee6ba90eabea5eb8fbbe58480ebbaa4eca098e7ad8ceb97abd09d
UHC 筌뚮뿨溢ㅿ쫶矣곗떵筌ㅻ씮源꾥돻儀뺤젘筌뗫Н 111011111010011110001100111010111001011110101000111011001110111010100100111011111010011010001101111010111111100010110000111011001011011010111010111011111010011110100100111010111001110110111111111010101011100110000100111010001000100110111110111010111111000010010101111011001010000010010100111011111010011110001011111010111010110010101111 efa78ceb97a8eceea4efa68debf8b0ecb6baefa7a4eb9dbfeab984e889beebf095eca094efa78bebacaf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)