To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN ?諷呈??諷呈?}v?諷呈??諷呈?}vB 0011111111100110100001011001001011100110001111110011111111100110100001011001001011100110001111110111110101110110001111111110011010000101100100101110011000111111001111111110011010000101100100101110011000111111011111010111011001000010 3fe68592e63f3fe68592e63f7d763fe68592e63f3fe68592e63f7d7642
EUC-JP ?諷呈??諷呈?}v?諷呈??諷呈?}vB 0011111111101011111001011100010011101000001111110011111111101011111001011100010011101000001111110111110101110110001111111110101111100101110001001110100000111111001111111110101111100101110001001110100000111111011111010111011001000010 3febe5c4e83f3febe5c4e83f7d763febe5c4e83f3febe5c4e83f7d7642
UTF-8 뤋諷呈쯓뤋諷呈쩸}v뤋諷呈쯓뤋諷呈쩸}vB 1110101110100100100010111110100010101011101101111110010110010001100010001110110010101111100100111110101110100100100010111110100010101011101101111110010110010001100010001110110010101001101110000111110101110110111010111010010010001011111010001010101110110111111001011001000110001000111011001010111110010011111010111010010010001011111010001010101110110111111001011001000110001000111011001010100110111000011111010111011001000010 eba48be8abb7e59188ecaf93eba48be8abb7e59188eca9b87d76eba48be8abb7e59188ecaf93eba48be8abb7e59188eca9b87d7642
UHC 뤋諷呈쯓뤋諷呈쩸}v뤋諷呈쯓뤋諷呈쩸}vB 10001111101110111111100110100100111011111101000010101001010011111000111110111011111110011010010011101111110100001010010101101110011111010111011010001111101110111111100110100100111011111101000010101001010011111000111110111011111110011010010011101111110100001010010101101110011111010111011001000010 8fbbf9a4efd0a94f8fbbf9a4efd0a56e7d768fbbf9a4efd0a94f8fbbf9a4efd0a56e7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)