To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???W^???\}v???W^???\}vB 0011111100111111001111110101011101011110001111110011111100111111010111000111110101110110001111110011111100111111010101110101111000111111001111110011111101011100011111010111011001000010 3f3f3f575e3f3f3f5c7d763f3f3f575e3f3f3f5c7d7642
SJIS-WIN 衙?刎W^衙?刎\}v衙?刎W^衙?刎\}vB 11100101110010010011111110011001100001100101011101011110111001011100100100111111100110011000011001011100011111010111011011100101110010010011111110011001100001100101011101011110111001011100100100111111100110011000011001011100011111010111011001000010 e5c93f9986575ee5c93f99865c7d76e5c93f9986575ee5c93f99865c7d7642
EUC-JP 衙?刎W^衙?刎\}v衙?刎W^衙?刎\}vB 11101010110010110011111111010001111001100101011101011110111010101100101100111111110100011110011001011100011111010111011011101010110010110011111111010001111001100101011101011110111010101100101100111111110100011110011001011100011111010111011001000010 eacb3fd1e6575eeacb3fd1e65c7d76eacb3fd1e6575eeacb3fd1e65c7d7642
UTF-8 衙뤾刎W^衙뤾刎\}v衙뤾刎W^衙뤾刎\}vB 1110100010100001100110011110101110100100101111101110010110001000100011100101011101011110111010001010000110011001111010111010010010111110111001011000100010001110010111000111110101110110111010001010000110011001111010111010010010111110111001011000100010001110010101110101111011101000101000011001100111101011101001001011111011100101100010001000111001011100011111010111011001000010 e8a199eba4bee5888e575ee8a199eba4bee5888e5c7d76e8a199eba4bee5888e575ee8a199eba4bee5888e5c7d7642
UHC 衙뤾刎W^衙뤾刎\}v衙뤾刎W^衙뤾刎\}vB 1110010010110111100011111110101011011001111110110101011101011110111001001011011110001111111010101101100111111011010111000111110101110110111001001011011110001111111010101101100111111011010101110101111011100100101101111000111111101010110110011111101101011100011111010111011001000010 e4b78fead9fb575ee4b78fead9fb5c7d76e4b78fead9fb575ee4b78fead9fb5c7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)