To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鸚??意??議??押り?愉↑????? 1110101001011111001111110011111110001000110100110011111100111111100010110110001100111111001111111000100110011111100000101110100000111111100101101111100110000001101010100011111100111111001111110011111100111111 ea5f3f3f88d33f3f8b633f3f899f82e83f96f981aa3f3f3f3f3f
EUC-JP 鸚??意??議??押り?愉↑????孼 11110011110000000011111100111111101100001101010100111111001111111011010111000100001111110011111110110010101000011010010011101010001111111100110011111011101000101010110000111111001111110011111100111111100011111011101011000011 f3c03f3fb0d53f3fb5c43f3fb2a1a4ea3fccfba2ac3f3f3f3f8fbac3
UTF-8 鸚쒓퍓意쎿룚議쏀뱺押り램愉↑삏戮곕닰孼 111010011011100010011010111011001001001010010011111011011000110110010011111001101000010010001111111011001000111010111111111010111010001110011010111010001010110110110000111011001000111110000000111010111011000110111010111001101000101010111100111000111000001010001010111010111001111010101000111001101000010010001001111000101000011010010001111011001000001010001111111011111010011110010010111010101011001110010101111010111000101110110000111001011010110110111100 e9b89aec9293ed8d93e6848fec8ebfeba39ae8adb0ec8f80ebb1bae68abce3828aeb9ea8e68489e28691ec828fefa792eab395eb8bb0e5adbc
UHC 鸚쒓퍓意쎿룚議쏀뱺押り램愉↑삏戮곕닰孼 1110010110100100100111001110101010111011100010101110101111110010100110111110011010001111100101101110110010100001101111011110110110010011101000001110010011100011101010101110101010110111101001011110101011110000101000011110100010011000100101101110101110111101101100001110101110001000101001101110010111101101 e5a49ceabb8aebf29be68f96eca1bded93a0e4e3aaeab7a5eaf0a1e89896ebbdb0eb88a6e5ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)