To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 筌??誼??蹂μ?}v筌??誼??蹂μ?}vB 11100010101000110011111100111111100010110110001000111111001111111110011011111000100000111100101000111111011111010111011011100010101000110011111100111111100010110110001000111111001111111110011011111000100000111100101000111111011111010111011001000010 e2a33f3f8b623f3fe6f883ca3f7d76e2a33f3f8b623f3fe6f883ca3f7d7642
EUC-JP 筌??誼??蹂μ?}v筌??誼??蹂μ?}vB 11100100101001010011111100111111101101011100001100111111001111111110110011111010101001101100110000111111011111010111011011100100101001010011111100111111101101011100001100111111001111111110110011111010101001101100110000111111011111010111011001000010 e4a53f3fb5c33f3fecfaa6cc3f7d76e4a53f3fb5c33f3fecfaa6cc3f7d7642
UTF-8 筌뗫봾誼랃쭓蹂μ젢}v筌뗫봾誼랃쭓蹂μ젢}vB 111001111010110110001100111010111001011110101011111010111011010010111110111010001010101010111100111010111001111010000011111011001010110110010011111010001011100110000010110011101011110011101100101000001010001001111101011101101110011110101101100011001110101110010111101010111110101110110100101111101110100010101010101111001110101110011110100000111110110010101101100100111110100010111001100000101100111010111100111011001010000010100010011111010111011001000010 e7ad8ceb97abebb4bee8aabceb9e83ecad93e8b982cebceca0a27d76e7ad8ceb97abebb4bee8aabceb9e83ecad93e8b982cebceca0a27d7642
UHC 筌뗫봾誼랃쭓蹂μ젢}v筌뗫봾誼랃쭓蹂μ젢}vB 1110111110100111100010111110101110010100100001011110101111111110100011011110111110100111100010111110101110110011101001011110110010100000100110110111110101110110111011111010011110001011111010111001010010000101111010111111111010001101111011111010011110001011111010111011001110100101111011001010000010011011011111010111011001000010 efa78beb9485ebfe8defa78bebb3a5eca09b7d76efa78beb9485ebfe8defa78bebb3a5eca09b7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)