To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 罌??飮???с?v罌??飮???с?vB 111000111010000000111111001111111001111101011010001111110011111100111111100001001000001100111111011101101110001110100000001111110011111110011111010110100011111100111111001111111000010010000011001111110111011001000010 e3a03f3f9f5a3f3f3f84833f76e3a03f3f9f5a3f3f3f84833f7642
EUC-JP 罌??飮???с?v罌??飮???с?vB 111001101010001000111111001111111101110110111011001111110011111100111111101001111110001100111111011101101110011010100010001111110011111111011101101110110011111100111111001111111010011111100011001111110111011001000010 e6a23f3fddbb3f3f3fa7e33f76e6a23f3fddbb3f3f3fa7e33f7642
UTF-8 罌삠끏飮닷뎄戮с궠v罌삠끏飮닷뎄戮с궠vB 11100111101111011000110011101100100000101010000011101011100000011000111111101001101000111010111011101011100010111011011111101011100011101000010011101111101001111001001011010001100000011110101010110110101000000111011011100111101111011000110011101100100000101010000011101011100000011000111111101001101000111010111011101011100010111011011111101011100011101000010011101111101001111001001011010001100000011110101010110110101000000111011001000010 e7bd8cec82a0eb818fe9a3aeeb8bb7eb8e84efa792d181eab6a076e7bd8cec82a0eb818fe9a3aeeb8bb7eb8e84efa792d181eab6a07642
UHC 罌삠끏飮닷뎄戮с궠v罌삠끏飮닷뎄戮с궠vB 111001011010001010111011111000111000010110111111111010111110011010110100111001011011010110101100111010111011110110101100111000111000001010110011011101101110010110100010101110111110001110000101101111111110101111100110101101001110010110110101101011001110101110111101101011001110001110000010101100110111011001000010 e5a2bbe385bfebe6b4e5b5acebbdace382b376e5a2bbe385bfebe6b4e5b5acebbdace382b37642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)