To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN ?諷漿?幀臣??}v?諷漿?幀臣??}vB 0011111111100110100001011001111111110111001111111001101111101010100100000110001000111111001111110111110101110110001111111110011010000101100111111111011100111111100110111110101010010000011000100011111100111111011111010111011001000010 3fe6859ff73f9bea90623f3f7d763fe6859ff73f9bea90623f3f7d7642
EUC-JP ?諷漿?幀臣??}v?諷漿?幀臣??}vB 0011111111101011111001011101111011111001001111111101011011101100101111111100001100111111001111110111110101110110001111111110101111100101110111101111100100111111110101101110110010111111110000110011111100111111011111010111011001000010 3febe5def93fd6ecbfc33f3f7d763febe5def93fd6ecbfc33f3f7d7642
UTF-8 뤋諷漿㉩幀臣샅렗}v뤋諷漿㉩幀臣샅렗}vB 1110101110100100100010111110100010101011101101111110011010111100101111111110001110001001101010011110010110111001100000001110100010000111101000111110110010000011100001011110101110100000100101110111110101110110111010111010010010001011111010001010101110110111111001101011110010111111111000111000100110101001111001011011100110000000111010001000011110100011111011001000001110000101111010111010000010010111011111010111011001000010 eba48be8abb7e6bcbfe389a9e5b980e887a3ec8385eba0977d76eba48be8abb7e6bcbfe389a9e5b980e887a3ec8385eba0977d7642
UHC 뤋諷漿㉩幀臣샅렗}v뤋諷漿㉩幀臣샅렗}vB 10001111101110111111100110100100111011011110110010101000101110101110111111010011111000111110110110111011111101001000111010101100011111010111011010001111101110111111100110100100111011011110110010101000101110101110111111010011111000111110110110111011111101001000111010101100011111010111011001000010 8fbbf9a4edeca8baefd3e3edbbf48eac7d768fbbf9a4edeca8baefd3e3edbbf48eac7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)