To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????d}????????d{^ 001111110011111100111111001111110011111100111111001111110011111101100100011111010011111100111111001111110011111100111111001111110011111100111111011001000111101101011110 3f3f3f3f3f3f3f3f647d3f3f3f3f3f3f3f3f647b5e
SJIS-WIN 逵ク?讌逶豸閙譟d}逵ク?讌逶豸閙譟d{^ 111001111001110010111000001111111110011010100101111001111001101111100110101101101110100001111110111001101001111101100100011111011110011110011100101110000011111111100110101001011110011110011011111001101011011011101000011111101110011010011111011001000111101101011110 e79cb83fe6a5e79be6b6e87ee69f647de79cb83fe6a5e79be6b6e87ee69f647b5e
EUC-JP 逵ク訵讌逶豸閙譟d}逵ク訵讌逶豸閙譟d{^ 111011011111110010001110101110001000111111011101110100111110110010100111111011011111101111101100101110001110111111011111111011001010000101100100011111011110110111111100100011101011100010001111110111011101001111101100101001111110110111111011111011001011100011101111110111111110110010100001011001000111101101011110 edfc8eb88fddd3eca7edfbecb8efdfeca1647dedfc8eb88fddd3eca7edfbecb8efdfeca1647b5e
UTF-8 逵ク訵讌逶豸閙譟d}逵ク訵讌逶豸閙譟d{^ 1110100110000000101101011110111110111101101110001110100010101000101101011110100010101110100011001110100110000000101101101110100010110001101110001110100110010110100110011110100010101101100111110110010001111101111010011000000010110101111011111011110110111000111010001010100010110101111010001010111010001100111010011000000010110110111010001011000110111000111010011001011010011001111010001010110110011111011001000111101101011110 e980b5efbdb8e8a8b5e8ae8ce980b6e8b1b8e99699e8ad9f647de980b5efbdb8e8a8b5e8ae8ce980b6e8b1b8e99699e8ad9f647b5e
UHC 逵???????d}逵???????d{^ 1101000010110000001111110011111100111111001111110011111100111111001111110110010001111101110100001011000000111111001111110011111100111111001111110011111100111111011001000111101101011110 d0b03f3f3f3f3f3f3f647dd0b03f3f3f3f3f3f3f647b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)