To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ?m??れ????^ 001111111000001010001101001111110011111110000010111010100011111100111111001111110011111101011110 3f828d3f3f82ea3f3f3f3f5e
EUC-JP ?m??れŁ???^ 0011111110100011111011010011111100111111101001001110110010001111101010011010100000111111001111110011111101011110 3fa3ed3f3fa4ec8fa9a83f3f3f5e
UTF-8 淋m쉾淋れŁ淋믪㏏^ 111011111010011110110101111011111011110110001101111011001000100110111110111011111010011110110101111000111000001010001100110001011000000111101111101001111011010111101011101011111010101011100011100011111000111101011110 efa7b5efbd8dec89beefa7b5e3828cc581efa7b5ebafaae38f8f5e
UHC 淋m쉾淋れŁ淋믪㏏^ 11101100111110001010001111101101100110101001001011101100111110001010101011101100101010001010100111101100111110001001001011101100101001111011100101011110 ecf8a3ed9a92ecf8aaeca8a9ecf892eca7b95e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)