To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????E 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000101 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f45
SJIS-WIN 汚?????阿??節?汚?????阿??節?E 1000100110011000001111110011111100111111001111110011111110001000101000100011111100111111100100001101111100111111100010011001100000111111001111110011111100111111001111111000100010100010001111110011111110010000110111110011111101000101 89983f3f3f3f3f88a23f3f90df3f89983f3f3f3f3f88a23f3f90df3f45
EUC-JP 汚?????阿??節?汚?????阿??節?E 1011000111111000001111110011111100111111001111110011111110110000101001000011111100111111110000001110000100111111101100011111100000111111001111110011111100111111001111111011000010100100001111110011111111000000111000010011111101000101 b1f83f3f3f3f3fb0a43f3fc0e13fb1f83f3f3f3f3fb0a43f3fc0e13f45
UTF-8 汚뗥ㄼ呂양괵阿쀯숱節쮇汚뗥ㄼ呂양괵阿쀯숱節쮊E 11100110101100011001101011101011100101111010010111100011100001001011110011101111101001101000000011101100100101101001000111101010101101001011010111101001100110001011111111101100100000001010111111101100100010001011000111100111101011111000000011101100101011101000011111100110101100011001101011101011100101111010010111100011100001001011110011101111101001101000000011101100100101101001000111101010101101001011010111101001100110001011111111101100100000001010111111101100100010001011000111100111101011111000000011101100101011101000101001000101 e6b19aeb97a5e384bcefa680ec9691eab4b5e998bfec80afec88b1e7af80ecae87e6b19aeb97a5e384bcefa680ec9691eab4b5e998bfec80afec88b1e7af80ecae8a45
UHC 汚뗥ㄼ呂양괵阿쀯숱節쮇汚뗥ㄼ呂양괵阿쀯숱節쮊E 111001111111110110001011111001011010010010101100111001011111101110111110111001111011000110101100111001001011100110010111111011111011110110100010111011111011110110101000010110011110011111111101100010111110010110100100101011001110010111111011101111101110011110110001101011001110010010111001100101111110111110111101101000101110111110111101101010000110001001000101 e7fd8be5a4ace5fbbee7b1ace4b997efbda2efbda859e7fd8be5a4ace5fbbee7b1ace4b997efbda2efbda86245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)