To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌Q??筌??泣??恂ъ?筌Q??筌??泣 11100010101000111000001001110000001111110011111111100010101000110011111100111111100010111000001100111111001111111001110010010110100001001000110000111111111000101010001110000010011100000011111100111111111000101010001100111111001111111000101110000011 e2a382703f3fe2a33f3f8b833f3f9c96848c3fe2a382703f3fe2a33f3f8b83
EUC-JP 筌Q??筌??泣??恂ъ?筌Q??筌??泣 11100100101001011010001111010001001111110011111111100100101001010011111100111111101101011110001100111111001111111101011111110110101001111110110000111111111001001010010110100011110100010011111100111111111001001010010100111111001111111011010111100011 e4a5a3d13f3fe4a53f3fb5e33f3fd7f6a7ec3fe4a5a3d13f3fe4a53f3fb5e3
UTF-8 筌Q뗭뒠筌듸퐤泣뚳㎗恂ъ퐷筌Q뗭뒠筌듸퐤泣 1110011110101101100011001110111110111100101100011110101110010111101011011110101110010010101000001110011110101101100011001110101110010011101110001110110110010000101001001110011010110011101000111110101110011010101100111110001110001110100101111110011010000001100000101101000110001010111011011001000010110111111001111010110110001100111011111011110010110001111010111001011110101101111010111001001010100000111001111010110110001100111010111001001110111000111011011001000010100100111001101011001110100011 e7ad8cefbcb1eb97adeb92a0e7ad8ceb93b8ed90a4e6b3a3eb9ab3e38e97e68182d18aed90b7e7ad8cefbcb1eb97adeb92a0e7ad8ceb93b8ed90a4e6b3a3
UHC 筌Q뗭뒠筌듸퐤泣뚳㎗恂ъ퐷筌Q뗭뒠筌듸퐤泣 111011111010011110100011110100011000101111101100100010101001110011101111101001111011010111101111101111011000110111101011111010001000110011101111101001111010001111100010111000011010110011101100101111011010000011101111101001111010001111010001100010111110110010001010100111001110111110100111101101011110111110111101100011011110101111101000 efa7a3d18bec8a9cefa7b5efbd8debe88cefa7a3e2e1acecbda0efa7a3d18bec8a9cefa7b5efbd8debe8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)