To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????v????vB 0011111100111111001111110011111101110110001111110011111100111111001111110111011001000010 3f3f3f3f763f3f3f3f7642
SJIS-WIN 鬩嶺クマv鬩嶺クマvB 1110100110101001100101111110010010111000100000110111110101110110111010011010100110010111111001001011100010000011011111010111011001000010 e9a997e4b8837d76e9a997e4b8837d7642
EUC-JP 鬩嶺クマv鬩嶺クマvB 11110010101010111100111011100110100011101011100010100101110111100111011011110010101010111100111011100110100011101011100010100101110111100111011001000010 f2abcee68eb8a5de76f2abcee68eb8a5de7642
UTF-8 鬩嶺クマv鬩嶺クマvB 111010011010110010101001111001011011011010111010111011111011110110111000111000111000001110011110011101101110100110101100101010011110010110110110101110101110111110111101101110001110001110000011100111100111011001000010 e9aca9e5b6baefbdb8e3839e76e9aca9e5b6baefbdb8e3839e7642
UHC ?嶺?マv?嶺?マvB 001111111101011010111010001111111010101111011110011101100011111111010110101110100011111110101011110111100111011001000010 3fd6ba3fabde763fd6ba3fabde7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)