To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????T?????????? 0011111100111111001111110011111100111111001111110101010000111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f543f3f3f3f3f3f3f3f3f3f
SJIS-WIN 絶??章??T絶??章??絶??壯 1001000011100010001111110011111110001111110011010011111100111111010101001001000011100010001111110011111110001111110011010011111100111111100100001110001000111111001111111001101011100001 90e23f3f8fcd3f3f5490e23f3f8fcd3f3f90e23f3f9ae1
EUC-JP 絶??章??T絶??章??絶??壯 1100000011100100001111110011111110111110110011110011111100111111010101001100000011100100001111110011111110111110110011110011111100111111110000001110010000111111001111111101010011100011 c0e43f3fbecf3f3f54c0e43f3fbecf3f3fc0e43f3fd4e3
UTF-8 絶랃풘章쏈벘T絶랃풘章쏈벘絶랃풘壯 11100111101101011011011011101011100111101000001111101101100100101001100011100111101010111010000011101100100011111000100011101011101100101001100001010100111001111011010110110110111010111001111010000011111011011001001010011000111001111010101110100000111011001000111110001000111010111011001010011000111001111011010110110110111010111001111010000011111011011001001010011000111001011010001110101111 e7b5b6eb9e83ed9298e7aba0ec8f88ebb29854e7b5b6eb9e83ed9298e7aba0ec8f88ebb298e7b5b6eb9e83ed9298e5a3af
UHC 絶랃풘章쏈벘T絶랃풘章쏈벘絶랃풘壯 111011111011111010001101111011111011111010011011111011011111000110011011111011101001001110110101010101001110111110111110100011011110111110111110100110111110110111110001100110111110111010010011101101011110111110111110100011011110111110111110100110111110110111100000 efbe8defbe9bedf19bee93b554efbe8defbe9bedf19bee93b5efbe8defbe9bede0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)