To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ??????宥??[??????宥??[^ 0011111100111111001111110011111100111111001111111001011101000111001111110011111101011011001111110011111100111111001111110011111100111111100101110100011100111111001111110101101101011110 3f3f3f3f3f3f97473f3f5b3f3f3f3f3f3f97473f3f5b5e
EUC-JP 倻??沅??宥??[倻??沅??宥??[^ 10001111101100011111011000111111001111111000111111000110111010010011111100111111110011011010100000111111001111110101101110001111101100011111011000111111001111111000111111000110111010010011111100111111110011011010100000111111001111110101101101011110 8fb1f63f3f8fc6e93f3fcda83f3f5b8fb1f63f3f8fc6e93f3fcda83f3f5b5e
UTF-8 倻귣떩沅뺝쫩宥산텞[倻귣떩沅뺝쫩宥산텞[^ 111001011000000010111011111010101011011110100011111010111001011010101001111001101011001010000101111010111011101010011101111011001010101110101001111001011010111010100101111011001000001010110000111011011000010110011110010110111110010110000000101110111110101010110111101000111110101110010110101010011110011010110010100001011110101110111010100111011110110010101011101010011110010110101110101001011110110010000010101100001110110110000101100111100101101101011110 e580bbeab7a3eb96a9e6b285ebba9decaba9e5aea5ec82b0ed859e5be580bbeab7a3eb96a9e6b285ebba9decaba9e5aea5ec82b0ed859e5b5e
UHC 倻귣떩沅뺝쫩宥산텞[倻귣떩沅뺝쫩宥산텞[^ 111001011010011010000010111010111000101110111011111010101011011010010101111001011010011010000010111010101110100110111011111010101011011010010101010110111110010110100110100000101110101110001011101110111110101010110110100101011110010110100110100000101110101011101001101110111110101010110110100101010101101101011110 e5a682eb8bbbeab695e5a682eae9bbeab6955be5a682eb8bbbeab695e5a682eae9bbeab6955b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)