To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 礁ウ芝爵?豎芝 100011111100101010110011100011101100010110001110110111010011111111100110101100011000111011000101 8fcab38ec58edd3fe6b18ec5
EUC-JP 礁ウ芝爵栿豎芝 101111101100110010001110101100111011110011000111101111001101111110001111110000111101011111101100101100111011110011000111 becc8eb3bcc7bcdf8fc3d7ecb3bcc7
UTF-8 礁ウ芝爵栿豎芝 111001111010010010000001111011111011110110110011111010001000101010011101111001111000100010110101111001101010000010111111111010001011000110001110111010001000101010011101 e7a481efbdb3e88a9de788b5e6a0bfe8b18ee88a9d
UHC 礁?芝爵??芝 1111010110100111001111111111001010111001111011011100100100111111001111111111001010111001 f5a73ff2b9edc93f3ff2b9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)