To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 筌λ?嚥?????[筌λ?嚥?????[^ 111000101010001110000011110010010011111110011010100010110011111100111111001111110011111100111111010110111110001010100011100000111100100100111111100110101000101100111111001111110011111100111111001111110101101101011110 e2a383c93f9a8b3f3f3f3f3f5be2a383c93f9a8b3f3f3f3f3f5b5e
EUC-JP 筌λ?嚥?????[筌λ?嚥?????[^ 111001001010010110100110110010110011111111010011111010110011111100111111001111110011111100111111010110111110010010100101101001101100101100111111110100111110101100111111001111110011111100111111001111110101101101011110 e4a5a6cb3fd3eb3f3f3f3f3f5be4a5a6cb3fd3eb3f3f3f3f3f5b5e
UTF-8 筌λ챶嚥깍㎘紐껅엽[筌λ챶嚥깍㎘紐껅엽[^ 11100111101011011000110011001110101110111110110010110001101101101110010110011010101001011110101010111001100011011110001110001110100110001110111110100111100011111110101010111011100001011110110010010111101111010101101111100111101011011000110011001110101110111110110010110001101101101110010110011010101001011110101010111001100011011110001110001110100110001110111110100111100011111110101010111011100001011110110010010111101111010101101101011110 e7ad8ccebbecb1b6e59aa5eab98de38e98efa78feabb85ec97bd5be7ad8ccebbecb1b6e59aa5eab98de38e98efa78feabb85ec97bd5b5e
UHC 筌λ챶嚥깍㎘紐껅엽[筌λ챶嚥깍㎘紐껅엽[^ 111011111010011110100101111010111010101010000011111001101011111110110001111011111010011110100101111010111010101010000011111001101011111110110001010110111110111110100111101001011110101110101010100000111110011010111111101100011110111110100111101001011110101110101010100000111110011010111111101100010101101101011110 efa7a5ebaa83e6bfb1efa7a5ebaa83e6bfb15befa7a5ebaa83e6bfb1efa7a5ebaa83e6bfb15b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)