To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 円??獄??辱ユ?薔??円??節 1000100101111110001111110011111110001101100101100011111100111111100100000100101010000011100001100011111111100101010010110011111100111111100010010111111000111111001111111001000011011111 897e3f3f8d963f3f904a83863fe54b3f3f897e3f3f90df
EUC-JP 円??獄??辱ユ?薔??円??節 1011000111011111001111110011111110111001111101100011111100111111101111111010101110100101111001100011111111101001101011000011111100111111101100011101111100111111001111111100000011100001 b1df3f3fb9f63f3fbfaba5e63fe9ac3f3fb1df3f3fc0e1
UTF-8 円띨쑎獄띰쉑辱ユ쇂薔⑼쉐円띨썿節 111001011000011010000110111010111001110110101000111011001001000110001110111001111000110110000100111010111001110110110000111011001000100110010001111010001011111010110001111000111000001110100110111011001000011110000010111010001001011010010100111000101001000110111100111011001000100110010000111001011000011010000110111010111001110110101000111011001000110110111111111001111010111110000000 e58686eb9da8ec918ee78d84eb9db0ec8991e8beb1e383a6ec8782e89694e291bcec8990e58686eb9da8ec8dbfe7af80
UHC 円띨쑎獄띰쉑辱ユ쇂薔⑼쉐円띨썿節 1110010111110111101101101110111010011100101011011110100010101011101101101110111110111101101001111110100110110100101010111110011010011001101101101110110111111001101010011110111110111101101001101110010111110111101101101110111010011011101010011110111110111101 e5f7b6ee9cade8abb6efbda7e9b4abe699b6edf9a9efbda6e5f7b6ee9ba9efbd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)