To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌Q??筌??肄??飮??筌l????肯泣? 11100010101000111000001001110000001111110011111111100010101000110011111100111111111000111110010100111111001111111001111101011010001111110011111111100010101000111000001010001100001111110011111100111111001111111000110101101101100010111000001100111111 e2a382703f3fe2a33f3fe3e53f3f9f5a3f3fe2a3828c3f3f3f3f8d6d8b833f
EUC-JP 筌Q??筌??肄??飮??筌l?飡??肯泣? 111001001010010110100011110100010011111100111111111001001010010100111111001111111110011011100111001111110011111111011101101110110011111100111111111001001010010110100011111011000011111110001111111010001100100000111111001111111011100111001110101101011110001100111111 e4a5a3d13f3fe4a53f3fe6e73f3fddbb3f3fe4a5a3ec3f8fe8c83f3fb9ceb5e33f
UTF-8 筌Q뗭뒆筌듐룂肄덌㎗飮됱뒔筌l늿飡볩㎗肯泣랟 111001111010110110001100111011111011110010110001111010111001011110101101111010111001001010000110111001111010110110001100111010111001001110010000111010111010001110000010111010001000001010000100111010111000110110001100111000111000111010010111111010011010001110101110111010111001000010110001111010111001001010010100111001111010110110001100111011111011110110001100111010111000101010111111111010011010001110100001111010111011001110101001111000111000111010010111111010001000001010101111111001101011001110100011111010111001111010011111 e7ad8cefbcb1eb97adeb9286e7ad8ceb9390eba382e88284eb8d8ce38e97e9a3aeeb90b1eb9294e7ad8cefbd8ceb8abfe9a3a1ebb3a9e38e97e882afe6b3a3eb9e9f
UHC 筌Q뗭뒆筌듐룂肄덌㎗飮됱뒔筌l늿飡볩㎗肯泣랟 1110111110100111101000111101000110001011111011001000101010000100111011111010011110110101111000111000111110000011111011001011110110001000111011111010011110100011111010111110011010001001111011001000101010010001111011111010011110100011111011001000100010001000111000011110001010010011111011111010011110100011110100001110100111101011111010001000111001000001 efa7a3d18bec8a84efa7b5e38f83ecbd88efa7a3ebe689ec8a91efa7a3ec8888e1e293efa7a3d0e9ebe88e41

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)