To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??倚цぜ鷹??繹??誼??怨??鶯 11100001100111110011111100111111100110001101111110000100100010001000001010111010100100011110100100111111001111111110001110001000001111110011111110001011011000100011111100111111100010011000010100111111001111111110100111110010 e19f3f3f98df848882ba91e93f3fe3883f3f8b623f3f89853f3fe9f2
EUC-JP 癲??倚цぜ鷹??繹??誼??怨??鶯 11100010101000010011111100111111110100001110000110100111111010001010010010111100110000101110101100111111001111111110010111101000001111110011111110110101110000110011111100111111101100011110010100111111001111111111001011110100 e2a13f3fd0e1a7e8a4bcc2eb3f3fe5e83f3fb5c33f3fb1e53f3ff2f4
UTF-8 癲꾧퀗倚цぜ鷹됱춪繹먮굞誼댐쫫怨뚯맭鶯 1110011110011001101100101110101010111110101001111110110110000000100101111110010110000000100110101101000110000110111000111000000110011100111010011011011110111001111010111001000010110001111011001011011010101010111001111011100110111001111010111010100010101110111010101011010110011110111010001010101010111100111010111000110010010000111011001010101110101011111001101000000010101000111010111001101010101111111010111010011110101101111010011011011010101111 e799b2eabea7ed8097e5809ad186e3819ce9b7b9eb90b1ecb6aae7b9b9eba8aeeab59ee8aabceb8c90ecababe680a8eb9aafeba7ade9b6af
UHC 癲꾧퀗倚цぜ鷹됱춪繹먮굞誼댐쫫怨뚯맭鶯 1110111110100110100001001110101010110011100011001110101111101111101011001110100010101010101111001110101111101101100010011110110010101101100001111110011010111010100100001110101110000010100001101110101111111110101101001110111110100110100001001110101010110011100011001110110010010000101101001110010110100011 efa684eab38cebeface8aabcebed89ecad87e6ba90eb8286ebfeb4efa684eab38cec90b4e5a3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)