To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????\}?????????\{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101110001111101001111110011111100111111001111110011111100111111001111110011111100111111010111000111101101011110 3f3f3f3f3f3f3f3f3f5c7d3f3f3f3f3f3f3f3f3f5c7b5e
SJIS-WIN 鼇??猶??猷??\}鼇??猶??猷??\{^ 1110101010000111001111110011111110010111010100000011111100111111100101110101000100111111001111110101110001111101111010101000011100111111001111111001011101010000001111110011111110010111010100010011111100111111010111000111101101011110 ea873f3f97503f3f97513f3f5c7dea873f3f97503f3f97513f3f5c7b5e
EUC-JP 鼇??猶??猷??\}鼇??猶??猷??\{^ 1111001111100111001111110011111111001101101100010011111100111111110011011011001000111111001111110101110001111101111100111110011100111111001111111100110110110001001111110011111111001101101100100011111100111111010111000111101101011110 f3e73f3fcdb13f3fcdb23f3f5c7df3e73f3fcdb13f3fcdb23f3f5c7b5e
UTF-8 鼇딅뜇猶뚦볕猷믩쥖\}鼇딅뜇猶뚦볕猷믩쥖\{^ 1110100110111100100001111110101110010100100001011110101110011100100001111110011110001100101101101110101110011010101001101110101110110011100101011110011110001100101101111110101110101111101010011110110010100101100101100101110001111101111010011011110010000111111010111001010010000101111010111001110010000111111001111000110010110110111010111001101010100110111010111011001110010101111001111000110010110111111010111010111110101001111011001010010110010110010111000111101101011110 e9bc87eb9485eb9c87e78cb6eb9aa6ebb395e78cb7ebafa9eca5965c7de9bc87eb9485eb9c87e78cb6eb9aa6ebb395e78cb7ebafa9eca5965c7b5e
UHC 鼇딅뜇猶뚦볕猷믩쥖\}鼇딅뜇猶뚦볕猷믩쥖\{^ 1110100010101000100010101110101110001101100010101110101110100010100011001110010110111010101101011110101110100011100100101110101110100010100011000101110001111101111010001010100010001010111010111000110110001010111010111010001010001100111001011011101010110101111010111010001110010010111010111010001010001100010111000111101101011110 e8a88aeb8d8aeba28ce5bab5eba392eba28c5c7de8a88aeb8d8aeba28ce5bab5eba392eba28c5c7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)