To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???}Y???}bE 0011111100111111001111110111110101011001001111110011111100111111011111010110001001000101 3f3f3f7d593f3f3f7d6245
SJIS-WIN テつ殉}Yテつ殉}bE 110000111000001011000010100011110111110101111101010110011100001110000010110000101000111101111101011111010110001001000101 c382c28f7d7d59c382c28f7d7d6245
EUC-JP テつ殉}Yテつ殉}bE 1000111011000011101001001100010010111101110111100111110101011001100011101100001110100100110001001011110111011110011111010110001001000101 8ec3a4c4bdde7d598ec3a4c4bdde7d6245
UTF-8 テつ殉}Yテつ殉}bE 1110111110111110100000111110001110000001101001001110011010101110100010010111110101011001111011111011111010000011111000111000000110100100111001101010111010001001011111010110001001000101 efbe83e381a4e6ae897d59efbe83e381a4e6ae897d6245
UHC ?つ殉}Y?つ殉}bE 001111111010101011000100111000101110011001111101010110010011111110101010110001001110001011100110011111010110001001000101 3faac4e2e67d593faac4e2e67d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)