To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[}?????????[{^ 0011111100111111001111110011111100111111001111110011111100111111001111110101101101111101001111110011111100111111001111110011111100111111001111110011111100111111010110110111101101011110 3f3f3f3f3f3f3f3f3f5b7d3f3f3f3f3f3f3f3f3f5b7b5e
SJIS-WIN 悟??幼??乙j?[}悟??幼??乙j?[{^ 10001100111001010011111100111111100101110110001100111111001111111000100110110011100000101000101000111111010110110111110110001100111001010011111100111111100101110110001100111111001111111000100110110011100000101000101000111111010110110111101101011110 8ce53f3f97633f3f89b3828a3f5b7d8ce53f3f97633f3f89b3828a3f5b7b5e
EUC-JP 悟??幼??乙j?[}悟??幼??乙j?[{^ 10111000111001110011111100111111110011011100010000111111001111111011001010110101101000111110101000111111010110110111110110111000111001110011111100111111110011011100010000111111001111111011001010110101101000111110101000111111010110110111101101011110 b8e73f3fcdc43f3fb2b5a3ea3f5b7db8e73f3fcdc43f3fb2b5a3ea3f5b7b5e
UTF-8 悟귣뀞幼싦벧乙j큿[}悟귣뀞幼싦벧乙j큿[{^ 1110011010000010100111111110101010110111101000111110101110000000100111101110010110111001101111001110110010001011101001101110101110110010101001111110010010111001100110011110111110111101100010101110110110000001101111110101101101111101111001101000001010011111111010101011011110100011111010111000000010011110111001011011100110111100111011001000101110100110111010111011001010100111111001001011100110011001111011111011110110001010111011011000000110111111010110110111101101011110 e6829feab7a3eb809ee5b9bcec8ba6ebb2a7e4b999efbd8aed81bf5b7de6829feab7a3eb809ee5b9bcec8ba6ebb2a7e4b999efbd8aed81bf5b7b5e
UHC 悟귣뀞幼싦벧乙j큿[}悟귣뀞幼싦벧乙j큿[{^ 1110011111110110100000101110101110000101100101011110101011101010100110101110010010111010101001101110101111100000101000111110101010110100100011000101101101111101111001111111011010000010111010111000010110010101111010101110101010011010111001001011101010100110111010111110000010100011111010101011010010001100010110110111101101011110 e7f682eb8595eaea9ae4baa6ebe0a3eab48c5b7de7f682eb8595eaea9ae4baa6ebe0a3eab48c5b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)