To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????[????????[^ 00111111001111110011111100111111001111110011111100111111001111110101101100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 狄礪狄炙狄礪狄炙[狄礪狄炙狄礪狄炙[^ 1110000010111101111000011110100011100000101111011110000001110100111000001011110111100001111010001110000010111101111000000111010001011011111000001011110111100001111010001110000010111101111000000111010011100000101111011110000111101000111000001011110111100000011101000101101101011110 e0bde1e8e0bde074e0bde1e8e0bde0745be0bde1e8e0bde074e0bde1e8e0bde0745b5e
EUC-JP 狄礪狄炙狄礪狄炙[狄礪狄炙狄礪狄炙[^ 1110000010111111111000101110101011100000101111111101111111010101111000001011111111100010111010101110000010111111110111111101010101011011111000001011111111100010111010101110000010111111110111111101010111100000101111111110001011101010111000001011111111011111110101010101101101011110 e0bfe2eae0bfdfd5e0bfe2eae0bfdfd55be0bfe2eae0bfdfd5e0bfe2eae0bfdfd55b5e
UTF-8 狄礪狄炙狄礪狄炙[狄礪狄炙狄礪狄炙[^ 111001111000101110000100111001111010010010101010111001111000101110000100111001111000001010011001111001111000101110000100111001111010010010101010111001111000101110000100111001111000001010011001010110111110011110001011100001001110011110100100101010101110011110001011100001001110011110000010100110011110011110001011100001001110011110100100101010101110011110001011100001001110011110000010100110010101101101011110 e78b84e7a4aae78b84e78299e78b84e7a4aae78b84e782995be78b84e7a4aae78b84e78299e78b84e7a4aae78b84e782995b5e
UHC 狄礪狄炙狄礪狄炙[狄礪狄炙狄礪狄炙[^ 1110111011011010110101011110110011101110110110101110110110110011111011101101101011010101111011001110111011011010111011011011001101011011111011101101101011010101111011001110111011011010111011011011001111101110110110101101010111101100111011101101101011101101101100110101101101011110 eedad5eceedaedb3eedad5eceedaedb35beedad5eceedaedb3eedad5eceedaedb35b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)