To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 檣?葬?杖?牆 1001111011111100001111111001000110010010001111111000111111110001001111111110000010101101 9efc3f91923f8ff13fe0ad
EUC-JP 檣?葬?杖?牆 1101110011111110001111111100000111110010001111111011111011110011001111111110000010101111 dcfe3fc1f23fbef33fe0af
UTF-8 檣렋葬렟杖렯牆 111001101010101010100011111010111010000010001011111010001001000110101100111010111010000010011111111001101001110110010110111010111010000010101111111001111000100110000110 e6aaa3eba08be891aceba09fe69d96eba0afe78986
UHC 檣렋葬렟杖렯牆 1110110111101010100011101010001011101101111101111000111010110000111011011110100010001110101111001110110111101101 edea8ea2edf78eb0ede88ebceded

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)