To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 疾杓シクシノシエ[疾杓シクシノシエ[^ 100011101011111010001110110110111011110010111000111100001100010110111100110010011011110010110100010110111000111010111110100011101101101110111100101110001111000011000101101111001100100110111100101101000101101101011110 8ebe8edbbcb8f0c5bcc9bcb45b8ebe8edbbcb8f0c5bcc9bcb45b5e
EUC-JP 疾杓シク?シノシエ[疾杓シク?シノシエ[^ 10111100110000001011110011011101100011101011110010001110101110000011111110001110101111001000111011001001100011101011110010001110101101000101101110111100110000001011110011011101100011101011110010001110101110000011111110001110101111001000111011001001100011101011110010001110101101000101101101011110 bcc0bcdd8ebc8eb83f8ebc8ec98ebc8eb45bbcc0bcdd8ebc8eb83f8ebc8ec98ebc8eb45b5e
UTF-8 疾杓シクシノシエ[疾杓シクシノシエ[^ 111001111001011010111110111001101001110110010011111011111011110110111100111011111011110110111000111011101000001010000100111011111011110110111100111011111011111010001001111011111011110110111100111011111011110110110100010110111110011110010110101111101110011010011101100100111110111110111101101111001110111110111101101110001110111010000010100001001110111110111101101111001110111110111110100010011110111110111101101111001110111110111101101101000101101101011110 e796bee69d93efbdbcefbdb8ee8284efbdbcefbe89efbdbcefbdb45be796bee69d93efbdbcefbdb8ee8284efbdbcefbe89efbdbcefbdb45b5e
UHC 疾杓???????[疾杓???????[^ 11110010111100001111100011110101001111110011111100111111001111110011111100111111001111110101101111110010111100001111100011110101001111110011111100111111001111110011111100111111001111110101101101011110 f2f0f8f53f3f3f3f3f3f3f5bf2f0f8f53f3f3f3f3f3f3f5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)