To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 齊朴??諸?廓珥 11101010100011101001011001110000001111110011111110001111100101000011111110001010011001101110000011100000 ea8e96703f3f8f943f8a66e0e0
EUC-JP 齊朴??諸?廓珥 11110011111011101100101111010001001111110011111110111101111101000011111110110011110001111110000011100010 f3eecbd13f3fbdf43fb3c7e0e2
UTF-8 齊朴답홀諸계廓珥 111010011011110110001010111001101001110010110100111010111000101110110101111011011001100110000000111010001010101110111000111010101011001110000100111001011011101110010011111001111000111110100101 e9bd8ae69cb4eb8bb5ed9980e8abb8eab384e5bb93e78fa5
UHC 齊朴답홀諸계廓珥 11110000101110101101101011010011101101001110010011001000101001101111000010110011101100001110100011001110101010011110110010110100 f0badad3b4e4c8a6f0b3b0e8cea9ecb4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)