To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????T?????????? 0011111100111111001111110011111100111111001111110101010000111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f543f3f3f3f3f3f3f3f3f3f
SJIS-WIN 絶??章??T絶??章??絶??? 10010000111000100011111100111111100011111100110100111111001111110101010010010000111000100011111100111111100011111100110100111111001111111001000011100010001111110011111100111111 90e23f3f8fcd3f3f5490e23f3f8fcd3f3f90e23f3f3f
EUC-JP 絶??章??T絶??章??絶??琰 110000001110010000111111001111111011111011001111001111110011111101010100110000001110010000111111001111111011111011001111001111110011111111000000111001000011111100111111100011111100110010110100 c0e43f3fbecf3f3f54c0e43f3fbecf3f3fc0e43f3f8fccb4
UTF-8 絶랃풘章쏈뒯T絶랃풘章쏈뱵絶랃풘琰 11100111101101011011011011101011100111101000001111101101100100101001100011100111101010111010000011101100100011111000100011101011100100101010111101010100111001111011010110110110111010111001111010000011111011011001001010011000111001111010101110100000111011001000111110001000111010111011000110110101111001111011010110110110111010111001111010000011111011011001001010011000111001111001000010110000 e7b5b6eb9e83ed9298e7aba0ec8f88eb92af54e7b5b6eb9e83ed9298e7aba0ec8f88ebb1b5e7b5b6eb9e83ed9298e790b0
UHC 絶랃풘章쏈뒯T絶랃풘章쏈뱵絶랃풘琰 111011111011111010001101111011111011111010011011111011011111000110011011111011101000101010101000010101001110111110111110100011011110111110111110100110111110110111110001100110111110111010010011100110111110111110111110100011011110111110111110100110111110011011111100 efbe8defbe9bedf19bee8aa854efbe8defbe9bedf19bee939befbe8defbe9be6fc

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)