To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN ?罕幀衛v?罕幀衛vB 0011111111100011101001011001101111101010100010010111000101110110001111111110001110100101100110111110101010001001011100010111011001000010 3fe3a59bea8971763fe3a59bea89717642
EUC-JP ?罕幀衛v?罕幀衛vB 0011111111100110101001111101011011101100101100011101001001110110001111111110011010100111110101101110110010110001110100100111011001000010 3fe6a7d6ecb1d2763fe6a7d6ecb1d27642
UTF-8 뤋罕幀衛v뤋罕幀衛vB 111010111010010010001011111001111011110110010101111001011011100110000000111010001010000110011011011101101110101110100100100010111110011110111101100101011110010110111001100000001110100010100001100110110111011001000010 eba48be7bd95e5b980e8a19b76eba48be7bd95e5b980e8a19b7642
UHC 뤋罕幀衛v뤋罕幀衛vB 10001111101110111111100111010110111011111101001111101010110110110111011010001111101110111111100111010110111011111101001111101010110110110111011001000010 8fbbf9d6efd3eadb768fbbf9d6efd3eadb7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)