To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???伊?????鴨??率③?臾??????? 00111111001111110011111110001000110010010011111100111111001111110011111100111111100010101001101100111111001111111001011110100110100001110100001000111111111001000110101100111111001111110011111100111111001111110011111100111111 3f3f3f88c93f3f3f3f3f8a9b3f3f97a687423fe46b3f3f3f3f3f3f3f
EUC-JP ???伊?????鴨??率??臾??????? 001111110011111100111111101100001100101100111111001111110011111100111111001111111011001111111011001111110011111111001110101010000011111100111111111001111100110000111111001111110011111100111111001111110011111100111111 3f3f3fb0cb3f3f3f3f3fb3fb3f3fcea83f3fe7cc3f3f3f3f3f3f3f
UTF-8 閱묐챷伊쒒렟類잙룆鴨앷퉵率③넫臾볥츊麗몃씈柳켇 111010011001011010110001111010111010110010010000111011001011000110110111111001001011110010001010111011001001001010010010111010111010000010011111111011111010011110010000111011001001111010011001111010111010001110000110111010011011010010101000111011001001010110110111111011011000100110110101111001111000111010000111111000101001000110100010111010111000010010101011111010001000011110111110111010111011001110100101111011001011100010001010111011111010011010001000111010111010101010000011111011001001010010001000111011111010011110001001111011001011110010000111 e996b1ebac90ecb1b7e4bc8aec9292eba09fefa790ec9e99eba386e9b4a8ec95b7ed89b5e78e87e291a2eb84abe887beebb3a5ecb88aefa688ebaa83ec9488efa789ecbc87
UHC 閱묐챷伊쒒렟類잙룆鴨앷퉵率③넫臾볥츊麗몃씈柳켇 11100110111100111001000111101011101010101000010011101100101001011001110011101001100011101011000011101011101110101001111111101011100011111000010111100100111001011001110111101010101110011000110111100001111000111010100011101001100001101010101111101011101011001001001111101011101011101000011011100110101100001011100011101011100111011010000011101010111101111011000101000101 e6f391ebaa84eca59ce98eb0ebba9feb8f85e4e59deab98de1e3a8e986abebac93ebae86e6b0b8eb9da0eaf7b145

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)