To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 畏?〕揖??筍る?畏?〕揖??筍る?^ 1000100011011000001111111000000101101100100101110100101100111111001111111110001010100001100000101110100100111111100010001101100000111111100000010110110010010111010010110011111100111111111000101010000110000010111010010011111101011110 88d83f816c974b3f3fe2a182e93f88d83f816c974b3f3fe2a182e93f5e
EUC-JP 畏?〕揖??筍る?畏?〕揖??筍る?^ 1011000011011010001111111010000111001101110011011010110000111111001111111110010010100011101001001110101100111111101100001101101000111111101000011100110111001101101011000011111100111111111001001010001110100100111010110011111101011110 b0da3fa1cdcdac3f3fe4a3a4eb3fb0da3fa1cdcdac3f3fe4a3a4eb3f5e
UTF-8 畏븍〕揖뷂쭇筍る쇊畏븍〕揖뷂쭇筍る쐷^ 11100111100101011000111111101011101110001000110111100011100000001001010111100110100011111001011011101011101101111000001011101100101011011000011111100111101011011000110111100011100000101000101111101100100001111000101011100111100101011000111111101011101110001000110111100011100000001001010111100110100011111001011011101011101101111000001011101100101011011000011111100111101011011000110111100011100000101000101111101100100100001011011101011110 e7958febb88de38095e68f96ebb782ecad87e7ad8de3828bec878ae7958febb88de38095e68f96ebb782ecad87e7ad8de3828bec90b75e
UHC 畏븍〕揖뷂쭇筍る쇊畏븍〕揖뷂쭇筍る쐷^ 11101000111001101011101011101011101000011011001111101011111001111001010011101111101001111000001111100010111011001010101011101011100110011011110011101000111001101011101011101011101000011011001111101011111001111001010011101111101001111000001111100010111011001010101011101011100111001001100101011110 e8e6baeba1b3ebe794efa783e2ecaaeb99bce8e6baeba1b3ebe794efa783e2ecaaeb9c995e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)