To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????BF 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001001000110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4246
SJIS-WIN ツ渉ォテ湘ュツゥテ青渉「テεスBF 11000010100011111100001010101011110000111000111111000011101011011100001010101001110000111001000011000010100011111100001010100010110000111000001111000011101111010100001001000110 c28fc2abc38fc3adc2a9c390c28fc2a2c383c3bd4246
EUC-JP ツ渉ォテ湘ュツゥテ青渉「テεスBF 1000111011000010101111101100010010001110101010111000111011000011101111101100010110001110101011011000111011000010100011101010100110001110110000111100000011000100101111101100010010001110101000101000111011000011101001101100010110001110101111010100001001000110 8ec2bec48eab8ec3bec58ead8ec28ea98ec3c0c4bec48ea28ec3a6c58ebd4246
UTF-8 ツ渉ォテ湘ュツゥテ青渉「テεスBF 11101111101111101000001011100110101110001000100111101111101111011010101111101111101111101000001111100110101110011001100011101111101111011010110111101111101111101000001011101111101111011010100111101111101111101000001111101001100111011001001011100110101110001000100111101111101111011010001011101111101111101000001111001110101101011110111110111101101111010100001001000110 efbe82e6b889efbdabefbe83e6b998efbdadefbe82efbda9efbe83e99d92e6b889efbda2efbe83ceb5efbdbd4246
UHC ????湘????????ε?BF 00111111001111110011111100111111110111111100111100111111001111110011111100111111001111110011111100111111001111111010010111100101001111110100001001000110 3f3f3f3fdfcf3f3f3f3f3f3f3f3fa5e53f4246

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)