To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??違??湲?????釉?????嗚??以 11100010101000110011111100111111100010001110000100111111001111111001111111010001001111110011111100111111001111110011111111100111110101100011111100111111001111110011111100111111100110100110101000111111001111111000100011001000 e2a33f3f88e13f3f9fd13f3f3f3f3fe7d63f3f3f3f3f9a6a3f3f88c8
EUC-JP 筌??違??湲?????釉?????嗚??以 11100100101001010011111100111111101100001110001100111111001111111101111011010011001111110011111100111111001111110011111111101110110110000011111100111111001111110011111100111111110100111100101100111111001111111011000011001010 e4a53f3fb0e33f3fded33f3f3f3f3feed83f3f3f3f3fd3cb3f3fb0ca
UTF-8 筌뗪막違댐쭓湲룸뼺料곗럩釉졿에類앸뼺嗚살럥以 111001111010110110001100111010111001011110101010111010111010011110001001111010011000000110010101111010111000110010010000111011001010110110010011111001101011100110110010111010111010001110111000111010111011110010111010111011111010011010111110111010101011001110010111111010111001111110101001111010011000011110001001111011001010000110111111111011001001011110010000111011111010011110010000111011001001010110111000111010111011110010111010111001011001011110011010111011001000001010110100111010111001111110100101111001001011101110100101 e7ad8ceb97aaeba789e98195eb8c90ecad93e6b9b2eba3b8ebbcbaefa6beeab397eb9fa9e98789eca1bfec9790efa790ec95b8ebbcbae5979aec82b4eb9fa5e4bba5
UHC 筌뗪막違댐쭓湲룸뼺料곗럩釉졿에類앸뼺嗚살럥以 1110111110100111100010111110101010111000101101111110101011011110101101001110111110100111100010111110101010111000101101111110101110010110101111011110100011110111101100001110110010001110100011001110101110111000101000001110011010111111101000011110101110111010100111011110101110010110101111011110011111110000101110111110110010001110100010001110110010100100 efa78beab8b7eadeb4efa78beab8b7eb96bde8f7b0ec8e8cebb8a0e6bfa1ebba9deb96bde7f0bbec8e88eca4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)