To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯??茵??惟る?筌?????淫??筌 1110100111110010001111110011111111100100100111110011111100111111100010001101001010000010111010010011111111100010101000110011111100111111001111110011111100111111100010001111101000111111001111111110001010100011 e9f23f3fe49f3f3f88d282e93fe2a33f3f3f3f3f88fa3f3fe2a3
EUC-JP 鶯??茵??惟る?筌??馹??淫??筌 11110010111101000011111100111111111010001010000100111111001111111011000011010100101001001110101100111111111001001010010100111111001111111000111111101001101000010011111100111111101100001111110000111111001111111110010010100101 f2f43f3fe8a13f3fb0d4a4eb3fe4a53f3f8fe9a13f3fb0fc3f3fe4a5
UTF-8 鶯끹끁茵뗥맅惟る짃筌뗢넂馹븀랜淫됰돇筌 111010011011011010101111111010111000000110111001111010111000000110000001111010001000110010110101111010111001011110100101111010111010011110000101111001101000001110011111111000111000001010001011111011001010011110000011111001111010110110001100111010111001011110100010111010111000010010000010111010011010011010111001111010111011100010000000111010111001111010011100111001101011011110101011111010111001000010110000111010111000111110000111111001111010110110001100 e9b6afeb81b9eb8181e88cb5eb97a5eba785e6839fe3828beca783e7ad8ceb97a2eb8482e9a6b9ebb880eb9e9ce6b7abeb90b0eb8f87e7ad8c
UHC 鶯끹끁茵뗥맅惟る짃筌뗢넂馹븀랜淫됰돇筌 1110010110100011100001011110001110000101101101111110110011100000100010111110010110010000100111111110101011101110101010101110101110100011100100111110111110100111100010111110001010000110100100101110110011110001101110101110011110110111101000111110101111100010100010011110101110001001100110001110111110100111 e5a385e385b7ece08be5909feaeeaaeba393efa78be28692ecf1bae7b7a3ebe289eb8998efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)