To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 狎??誼?┸柔レ? 1110000010111110001111110011111110001011011000100011111110000100101111011000111101011111100000111000110000111111 e0be3f3f8b623f84bd8f5f838c3f
EUC-JP 狎??誼?┸柔レ? 1110000011000000001111110011111110110101110000110011111110101000101111111011110111000000101001011110110000111111 e0c03f3fb5c33fa8bfbdc0a5ec3f
UTF-8 狎녴룗誼삼┸柔レ젴 111001111000101110001110111010111000010110110100111010111010001110010111111010001010101010111100111011001000001010111100111000101001010010111000111001101001111110010100111000111000001110101100111011001010000010110100 e78b8eeb85b4eba397e8aabcec82bce294b8e69f94e383aceca0b4
UHC 狎녴룗誼삼┸柔レ젴 111001001110010010000110111000111000111110010011111010111111111010111011111011111010011010111111111010101111010110101011111011001010000010101000 e4e486e38f93ebfebbefa6bfeaf5abeca0a8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)