To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣℡?轅⑤?艶j?維??癒??沃 00111111001111110011111110001011100000111000011110000100001111111110011101110110100001110100010000111111100010011001000010000010100010100011111110001000110110110011111100111111100101101111110000111111001111111001011110000000 3f3f3f8b8387843fe77687443f8990828a3f88db3f3f96fc3f3f9780
EUC-JP ???泣??轅??艶j?維??癒??沃 0011111100111111001111111011010111100011001111110011111111101101110101110011111100111111101100011111000010100011111010100011111110110000110111010011111100111111110011001111111000111111001111111100110111100000 3f3f3fb5e33f3fedd73f3fb1f0a3ea3fb0dd3f3fccfe3f3fcde0
UTF-8 捻꿔꺂泣℡칰轅⑤뎠艶j퍗維쏁춯癒뀄맋沃 111011111010011010100100111010101011111110010100111010101011101010000010111001101011001110100011111000101000010010100001111011001011100110110000111010001011110110000101111000101001000110100100111010111000111010100000111010001000100110110110111011111011110110001010111011011000110110010111111001111011011010101101111011001000111110000001111011001011011010101111111001111001100110010010111010111000000010000100111010111010011110001011111001101011001010000011 efa6a4eabf94eaba82e6b3a3e284a1ecb9b0e8bd85e291a4eb8ea0e889b6efbd8aed8d97e7b6adec8f81ecb6afe79992eb8084eba78be6b283
UHC 捻꿔꺂泣℡칰轅⑤뎠艶j퍗維쏁춯癒뀄맋沃 1110011011110111101100101110001110000011101010111110101111101000101000101110010110101111100000111110101010111111101010001110101110110101101100011110011011111101101000111110101010111011100011101110101110101011100110111110011110101101100011001110101110101000101100101110110110010000101000111110100010101010 e6f7b2e383abebe8a2e5af83eabfa8ebb5b1e6fda3eabb8eebab9be7ad8ceba8b2ed90a3e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)