To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 鴉??蘂?????}鴉??蘂?????{^ 11101001111010110011111100111111111001010100000100111111001111110011111100111111001111110111110111101001111010110011111100111111111001010100000100111111001111110011111100111111001111110111101101011110 e9eb3f3fe5413f3f3f3f3f7de9eb3f3fe5413f3f3f3f3f7b5e
EUC-JP 鴉??蘂?????}鴉??蘂?????{^ 11110010111011010011111100111111111010011010001000111111001111110011111100111111001111110111110111110010111011010011111100111111111010011010001000111111001111110011111100111111001111110111101101011110 f2ed3f3fe9a23f3f3f3f3f7df2ed3f3fe9a23f3f3f3f3f7b5e
UTF-8 鴉딅젷蘂섆펯溜띾젶}鴉딅젷蘂섆펯溜띾젶{^ 111010011011010010001001111010111001010010000101111011001010000010110111111010001001100010000010111011001000010010000110111011011000111010101111111011111010011110001011111010111001110110111110111011001010000010110110011111011110100110110100100010011110101110010100100001011110110010100000101101111110100010011000100000101110110010000100100001101110110110001110101011111110111110100111100010111110101110011101101111101110110010100000101101100111101101011110 e9b489eb9485eca0b7e89882ec8486ed8eafefa78beb9dbeeca0b67de9b489eb9485eca0b7e89882ec8486ed8eafefa78beb9dbeeca0b67b5e
UHC 鴉딅젷蘂섆펯溜띾젶}鴉딅젷蘂섆펯溜띾젶{^ 111001001011110010001010111010111010000010101011111001111101111010011000111001001011110010000001111010101111111010001101111010111010000010101010011111011110010010111100100010101110101110100000101010111110011111011110100110001110010010111100100000011110101011111110100011011110101110100000101010100111101101011110 e4bc8aeba0abe7de98e4bc81eafe8deba0aa7de4bc8aeba0abe7de98e4bc81eafe8deba0aa7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)