To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 鴉?????沃??}鴉?????沃??{^ 11101001111010110011111100111111001111110011111100111111100101111000000000111111001111110111110111101001111010110011111100111111001111110011111100111111100101111000000000111111001111110111101101011110 e9eb3f3f3f3f3f97803f3f7de9eb3f3f3f3f3f97803f3f7b5e
EUC-JP 鴉?????沃??}鴉?????沃??{^ 11110010111011010011111100111111001111110011111100111111110011011110000000111111001111110111110111110010111011010011111100111111001111110011111100111111110011011110000000111111001111110111101101011110 f2ed3f3f3f3f3fcde03f3f7df2ed3f3f3f3f3fcde03f3f7b5e
UTF-8 鴉딅젷娛붾젳沃뚮젽}鴉딅젷娛붾젳沃뚮젽{^ 111010011011010010001001111010111001010010000101111011001010000010110111111001011010100010011011111010111011011010111110111011001010000010110011111001101011001010000011111010111001101010101110111011001010000010111101011111011110100110110100100010011110101110010100100001011110110010100000101101111110010110101000100110111110101110110110101111101110110010100000101100111110011010110010100000111110101110011010101011101110110010100000101111010111101101011110 e9b489eb9485eca0b7e5a89bebb6beeca0b3e6b283eb9aaeeca0bd7de9b489eb9485eca0b7e5a89bebb6beeca0b3e6b283eb9aaeeca0bd7b5e
UHC 鴉딅젷娛붾젳沃뚮젽}鴉딅젷娛붾젳沃뚮젽{^ 111001001011110010001010111010111010000010101011111001111111010010010100111010111010000010100111111010001010101010001100111010111010000010101111011111011110010010111100100010101110101110100000101010111110011111110100100101001110101110100000101001111110100010101010100011001110101110100000101011110111101101011110 e4bc8aeba0abe7f494eba0a7e8aa8ceba0af7de4bc8aeba0abe7f494eba0a7e8aa8ceba0af7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)