To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 偲七偲フナ爾偲七 1000111011000011100011101011010110001110110000111100110011000101100011101010001010001110110000111000111010110101 8ec38eb58ec3ccc58ea28ec38eb5
EUC-JP 偲七偲フナ爾偲七 10111100110001011011110010110111101111001100010110001110110011001000111011000101101111001010010010111100110001011011110010110111 bcc5bcb7bcc58ecc8ec5bca4bcc5bcb7
UTF-8 偲七偲フナ爾偲七 111001011000000110110010111001001011100010000011111001011000000110110010111011111011111010001100111011111011111010000101111001111000100010111110111001011000000110110010111001001011100010000011 e581b2e4b883e581b2efbe8cefbe85e788bee581b2e4b883
UHC ?七???爾?七 0011111111110110110100100011111100111111001111111110110010110011001111111111011011010010 3ff6d23f3f3fecb33ff6d2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)