To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???烏l????異???ヨ。怡??昻??^ 0011111100111111001111111000100101000111100000101000110000111111001111110011111100111111100010001101100100111111001111110011111110000011100010001000000101000010100111000111110100111111001111111111101011010000001111110011111101011110 3f3f3f8947828c3f3f3f3f88d93f3f3f838881429c7d3f3ffad03f3f5e
EUC-JP ???烏l????異???ヨ。怡?????^ 00111111001111110011111110110001101010001010001111101100001111110011111100111111001111111011000011011011001111110011111100111111101001011110100010100001101000111101011111011110001111110011111100111111001111110011111101011110 3f3f3fb1a8a3ec3f3f3f3fb0db3f3f3fa5e8a1a3d7de3f3f3f3f3f5e
UTF-8 玲곷젷烏l츦隸욃떀異덄퓖溜ヨ。怡쒑닞昻뽰뎠^ 11101111101001101010110111101010101100111011011111101100101000001011011111100111100000111000111111101111101111011000110011101100101110001010011011101111101001101011100011101100100110101000001111101011100101101000000011100111100101011011000011101011100011011000010011101101100100111001011011101111101001111000101111100011100000111010100011100011100000001000001011100110100000001010000111101100100100101001000111101011100010111001111011100110100110001011101111101011101111011011000011101011100011101010000001011110 efa6adeab3b7eca0b7e7838fefbd8cecb8a6efa6b8ec9a83eb9680e795b0eb8d84ed9396efa78be383a8e38082e680a1ec9291eb8b9ee698bbebbdb0eb8ea05e
UHC 玲곷젷烏l츦隸욃떀異덄퓖溜ヨ。怡쒑닞昻뽰뎠^ 11100111101111111000000111101011101000001010101111101000101000011010001111101100101011101001110011100111111001101001111011100101100010111001011011101100101101101000100011100111101111111000000111101010111111101010101111101000101000011010001111101100101011101001110011101000100010001001111011100100111010011001011011101100101101011011000101011110 e7bf81eba0abe8a1a3ecae9ce7e69ee58b96ecb688e7bf81eafeabe8a1a3ecae9ce8889ee4e996ecb5b15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)