To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?^?th?^?tfN}?^?th?^?tfN{^ 00111111010111100011111101110100011010000011111101011110001111110111010001100110010011100111110100111111010111100011111101110100011010000011111101011110001111110111010001100110010011100111101101011110 3f5e3f74683f5e3f74664e7d3f5e3f74683f5e3f74664e7b5e
SJIS-WIN 奪^達th奪^達tfN}奪^達th奪^達tfN{^ 100100100100010001011110100100100100001001110100011010001001001001000100010111101001001001000010011101000110011001001110011111011001001001000100010111101001001001000010011101000110100010010010010001000101111010010010010000100111010001100110010011100111101101011110 92445e9242746892445e924274664e7d92445e9242746892445e924274664e7b5e
EUC-JP 奪^達th奪^達tfN}奪^達th奪^達tfN{^ 110000111010010101011110110000111010001101110100011010001100001110100101010111101100001110100011011101000110011001001110011111011100001110100101010111101100001110100011011101000110100011000011101001010101111011000011101000110111010001100110010011100111101101011110 c3a55ec3a37468c3a55ec3a374664e7dc3a55ec3a37468c3a55ec3a374664e7b5e
UTF-8 奪^達th奪^達tfN}奪^達th奪^達tfN{^ 1110010110100101101010100101111011101001100000011001010001110100011010001110010110100101101010100101111011101001100000011001010001110100011001100100111001111101111001011010010110101010010111101110100110000001100101000111010001101000111001011010010110101010010111101110100110000001100101000111010001100110010011100111101101011110 e5a5aa5ee981947468e5a5aa5ee9819474664e7de5a5aa5ee981947468e5a5aa5ee9819474664e7b5e
UHC 奪^達th奪^達tfN}奪^達th奪^達tfN{^ 111101111010110001011110110100111011100101110100011010001111011110101100010111101101001110111001011101000110011001001110011111011111011110101100010111101101001110111001011101000110100011110111101011000101111011010011101110010111010001100110010011100111101101011110 f7ac5ed3b97468f7ac5ed3b974664e7df7ac5ed3b97468f7ac5ed3b974664e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)