To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN ?り?泣??油??v?り?泣??油??vB 001111111000001011101000001111111000101110000011001111110011111110010110111110110011111100111111011101100011111110000010111010000011111110001011100000110011111100111111100101101111101100111111001111110111011001000010 3f82e83f8b833f3f96fb3f3f763f82e83f8b833f3f96fb3f3f7642
EUC-JP ?り?泣??油??v?り?泣??油??vB 001111111010010011101010001111111011010111100011001111110011111111001100111111010011111100111111011101100011111110100100111010100011111110110101111000110011111100111111110011001111110100111111001111110111011001000010 3fa4ea3fb5e33f3fccfd3f3f763fa4ea3fb5e33f3fccfd3f3f7642
UTF-8 閭り내泣섇쮦油뱀궩v閭り내泣섇쮦油뱀궩vB 111011111010011010000110111000111000001010001010111010111000001010110100111001101011001110100011111011001000010010000111111011001010111010100110111001101011001010111001111010111011000110000000111010101011011010101001011101101110111110100110100001101110001110000010100010101110101110000010101101001110011010110011101000111110110010000100100001111110110010101110101001101110011010110010101110011110101110110001100000001110101010110110101010010111011001000010 efa686e3828aeb82b4e6b3a3ec8487ecaea6e6b2b9ebb180eab6a976efa686e3828aeb82b4e6b3a3ec8487ecaea6e6b2b9ebb180eab6a97642
UHC 閭り내泣섇쮦油뱀궩v閭り내泣섇쮦油뱀궩vB 111001101010110110101010111010101011001110111011111010111110100010011000111001011010100010000011111010101111101010111001111011001000001010111011011101101110011010101101101010101110101010110011101110111110101111101000100110001110010110101000100000111110101011111010101110011110110010000010101110110111011001000010 e6adaaeab3bbebe898e5a883eafab9ec82bb76e6adaaeab3bbebe898e5a883eafab9ec82bb7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)