To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN ????る?娃??v????る?娃??vB 00111111001111110011111100111111100000101110100100111111100010001010000100111111001111110111011000111111001111110011111100111111100000101110100100111111100010001010000100111111001111110111011001000010 3f3f3f3f82e93f88a13f3f763f3f3f3f82e93f88a13f3f7642
EUC-JP ????る?娃??v????る?娃??vB 00111111001111110011111100111111101001001110101100111111101100001010001100111111001111110111011000111111001111110011111100111111101001001110101100111111101100001010001100111111001111110111011001000010 3f3f3f3fa4eb3fb0a33f3f763f3f3f3fa4eb3fb0a33f3f7642
UTF-8 黎곹쓷溜る젶娃좏뀫v黎곹쓷溜る젶娃좏뀫vB 111011111010011010001001111010101011001110111001111011001001001110110111111011111010011110001011111000111000001010001011111011001010000010110110111001011010100010000011111011001010001010001111111010111000000010101011011101101110111110100110100010011110101010110011101110011110110010010011101101111110111110100111100010111110001110000010100010111110110010100000101101101110010110101000100000111110110010100010100011111110101110000000101010110111011001000010 efa689eab3b9ec93b7efa78be3828beca0b6e5a883eca28feb80ab76efa689eab3b9ec93b7efa78be3828beca0b6e5a883eca28feb80ab7642
UHC 黎곹쓷溜る젶娃좏뀫v黎곹쓷溜る젶娃좏뀫vB 111001101011000110000001111011011001110110010100111010101111111010101010111010111010000010101010111010001101111110100000111011011000010110100001011101101110011010110001100000011110110110011101100101001110101011111110101010101110101110100000101010101110100011011111101000001110110110000101101000010111011001000010 e6b181ed9d94eafeaaeba0aae8dfa0ed85a176e6b181ed9d94eafeaaeba0aae8dfa0ed85a17642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)