To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????????TB 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101010001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5442
SJIS-WIN テッツァツ「テャツ按暗」ツ篠湘・ツァツィテォツ崢揺TB 110000111010111111000010101001111100001010100010110000111010110011000010100010001100001010001000110000111010001111000010100011101100001010001111110000111010010111000010101001111100001010101000110000111010101111000010100110111100001010010111011010000101010001000010 c3afc2a7c2a2c3acc288c288c3a3c28ec28fc3a5c2a7c2a8c3abc29bc297685442
EUC-JP テッツァツ「テャツ按暗」ツ篠湘・ツァツィテォツ崢揺TB 10001110110000111000111010101111100011101100001010001110101001111000111011000010100011101010001010001110110000111000111010101100100011101100001010110000110001001011000011000101100011101010001110001110110000101011110011000100101111101100010110001110101001011000111011000010100011101010011110001110110000101000111010101000100011101100001110001110101010111000111011000010110101101100010011001101110010010101010001000010 8ec38eaf8ec28ea78ec28ea28ec38eac8ec2b0c4b0c58ea38ec2bcc4bec58ea58ec28ea78ec28ea88ec38eab8ec2d6c4cdc95442
UTF-8 テッツァツ「テャツ按暗」ツ篠湘・ツァツィテォツ崢揺TB 1110111110111110100000111110111110111101101011111110111110111110100000101110111110111101101001111110111110111110100000101110111110111101101000101110111110111110100000111110111110111101101011001110111110111110100000101110011010001100100010011110011010011010100101111110111110111101101000111110111110111110100000101110011110101111101000001110011010111001100110001110111110111101101001011110111110111110100000101110111110111101101001111110111110111110100000101110111110111101101010001110111110111110100000111110111110111101101010111110111110111110100000101110010110110100101000101110011010001111101110100101010001000010 efbe83efbdafefbe82efbda7efbe82efbda2efbe83efbdacefbe82e68c89e69a97efbda3efbe82e7afa0e6b998efbda5efbe82efbda7efbe82efbda8efbe83efbdabefbe82e5b4a2e68fba5442
UHC ?????????按暗??篠湘??????????TB 00111111001111110011111100111111001111110011111100111111001111110011111111100100110011101110010011011110001111110011111111100001110001101101111111001111001111110011111100111111001111110011111100111111001111110011111100111111001111110101010001000010 3f3f3f3f3f3f3f3f3fe4cee4de3f3fe1c6dfcf3f3f3f3f3f3f3f3f3f3f5442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)