To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??靭???る??λ8???惟???k?已 111000011001111100111111001111111001000001111000001111110011111100111111100000101110100100111111001111111000001111001001100000100101011100111111001111110011111110001000110100100011111100111111001111111000001010001011001111111001101111011111 e19f3f3f90783f3f3f82e93f3f83c982573f3f3f88d23f3f3f828b3f9bdf
EUC-JP 癲??靭???る??λ8絪??惟???k?已 1110001010100001001111110011111110111111110110010011111100111111001111111010010011101011001111110011111110100110110010111010001110111000100011111101001111101100001111110011111110110000110101000011111100111111001111111010001111101011001111111101011011100001 e2a13f3fbfd93f3f3fa4eb3f3fa6cba3b88fd3ec3f3fb0d43f3f3fa3eb3fd6e1
UTF-8 癲앷쑬靭뗧땟戮る짎若λ8絪뜻첀惟겹늾力k뛼已 1110011110011001101100101110110010010101101101111110110010010001101011001110100110011101101011011110101110010111101001111110101110010101100111111110111110100111100100101110001110000010100010111110110010100111100011101110111110100101101101001100111010111011111011111011110010011000111001111011010110101010111010111001110010111011111011001011001010000000111001101000001110011111111010101011001010111001111010111000101010111110111011111010011010001010111011111011110110001011111010111001101110111100111001011011011110110010 e799b2ec95b7ec91ace99dadeb97a7eb959fefa792e3828beca78eefa5b4cebbefbc98e7b5aaeb9cbbecb280e6839feab2b9eb8abeefa68aefbd8beb9bbce5b7b2
UHC 癲앷쑬靭뗧땟戮る짎若λ8絪뜻첀惟겹늾力k뛼已 1110111110100110100111011110101010111110101010001110110011100101100010111110011110110110101011011110101110111101101010101110101110100011100110101110010110101110101001011110101110100011101110001110110011011111101101101110011010101010100011011110101011101110101100001110001110001000100001111110011010110011101000111110101110001101100000101110110010101011 efa69deabea8ece58be7b6adebbdaaeba39ae5aea5eba3b8ecdfb6e6aa8deaeeb0e38887e6b3a3eb8d82ecab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)