To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???T???M???T???M 00111111001111110011111101010100001111110011111100111111010011010011111100111111001111110101010000111111001111110011111101001101 3f3f3f543f3f3f4d3f3f3f543f3f3f4d
SJIS-WIN 怐唏鶺T怐唏鶺M怐唏鶺T怐唏鶺M 10011100100000011001101001001000111010100101010001010100100111001000000110011010010010001110101001010100010011011001110010000001100110100100100011101010010101000101010010011100100000011001101001001000111010100101010001001101 9c819a48ea54549c819a48ea544d9c819a48ea54549c819a48ea544d
EUC-JP 怐唏鶺T怐唏鶺M怐唏鶺T怐唏鶺M 11010111111000011101001110101001111100111011010101010100110101111110000111010011101010011111001110110101010011011101011111100001110100111010100111110011101101010101010011010111111000011101001110101001111100111011010101001101 d7e1d3a9f3b554d7e1d3a9f3b54dd7e1d3a9f3b554d7e1d3a9f3b54d
UTF-8 怐唏鶺T怐唏鶺M怐唏鶺T怐唏鶺M 11100110100000001001000011100101100101001000111111101001101101101011101001010100111001101000000010010000111001011001010010001111111010011011011010111010010011011110011010000000100100001110010110010100100011111110100110110110101110100101010011100110100000001001000011100101100101001000111111101001101101101011101001001101 e68090e5948fe9b6ba54e68090e5948fe9b6ba4de68090e5948fe9b6ba54e68090e5948fe9b6ba4d
UHC ???T???M???T???M 00111111001111110011111101010100001111110011111100111111010011010011111100111111001111110101010000111111001111110011111101001101 3f3f3f543f3f3f4d3f3f3f543f3f3f4d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)