To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????N}?????????N{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100111001111101001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN ?l????鷹??N}?l????鷹??N{^ 001111111000001010001100001111110011111100111111001111111001000111101001001111110011111101001110011111010011111110000010100011000011111100111111001111110011111110010001111010010011111100111111010011100111101101011110 3f828c3f3f3f3f91e93f3f4e7d3f828c3f3f3f3f91e93f3f4e7b5e
EUC-JP 渶l?佾??鷹??N}渶l?佾??鷹??N{^ 1000111111000111111011011010001111101100001111111000111110110000111110110011111100111111110000101110101100111111001111110100111001111101100011111100011111101101101000111110110000111111100011111011000011111011001111110011111111000010111010110011111100111111010011100111101101011110 8fc7eda3ec3f8fb0fb3f3fc2eb3f3f4e7d8fc7eda3ec3f8fb0fb3f3fc2eb3f3f4e7b5e
UTF-8 渶l룆佾㎫랜鷹낇뜢N}渶l룆佾㎫랜鷹낇뜢N{^ 1110011010111000101101101110111110111101100011001110101110100011100001101110010010111101101111101110001110001110101010111110101110011110100111001110100110110111101110011110101110000010100001111110101110011100101000100100111001111101111001101011100010110110111011111011110110001100111010111010001110000110111001001011110110111110111000111000111010101011111010111001111010011100111010011011011110111001111010111000001010000111111010111001110010100010010011100111101101011110 e6b8b6efbd8ceba386e4bdbee38eabeb9e9ce9b7b9eb8287eb9ca24e7de6b8b6efbd8ceba386e4bdbee38eabeb9e9ce9b7b9eb8287eb9ca24e7b5e
UHC 渶l룆佾㎫랜鷹낇뜢N}渶l룆佾㎫랜鷹낇뜢N{^ 1110011110110111101000111110110010001111100001011110110011101011101001111110011110110111101000111110101111101101100001011110110110001101101001010100111001111101111001111011011110100011111011001000111110000101111011001110101110100111111001111011011110100011111010111110110110000101111011011000110110100101010011100111101101011110 e7b7a3ec8f85eceba7e7b7a3ebed85ed8da54e7de7b7a3ec8f85eceba7e7b7a3ebed85ed8da54e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)