To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?T??}?T??{^ 0011111101010100001111110011111101111101001111110101010000111111001111110111101101011110 3f543f3f7d3f543f3f7b5e
SJIS-WIN 短T孫束}短T孫束{^ 1001001001011010010101001001000110110111100100011010100101111101100100100101101001010100100100011011011110010001101010010111101101011110 925a5491b791a97d925a5491b791a97b5e
EUC-JP 短T孫束}短T孫束{^ 1100001110111011010101001100001010111001110000101010101101111101110000111011101101010100110000101011100111000010101010110111101101011110 c3bb54c2b9c2ab7dc3bb54c2b9c2ab7b5e
UTF-8 短T孫束}短T孫束{^ 1110011110011111101011010101010011100101101011011010101111100110100111011001111101111101111001111001111110101101010101001110010110101101101010111110011010011101100111110111101101011110 e79fad54e5adabe69d9f7de79fad54e5adabe69d9f7b5e
UHC 短T孫束}短T孫束{^ 1101001110101101010101001110000111011101111000011101011001111101110100111010110101010100111000011101110111100001110101100111101101011110 d3ad54e1dde1d67dd3ad54e1dde1d67b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)