To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN ????坎?潔?衿v????坎?潔?衿vB 001111110011111100111111001111111001101010101010001111111000110010001001001111111000101111011100011101100011111100111111001111110011111110011010101010100011111110001100100010010011111110001011110111000111011001000010 3f3f3f3f9aaa3f8c893f8bdc763f3f3f3f9aaa3f8c893f8bdc7642
EUC-JP ????坎?潔?衿v????坎?潔?衿vB 001111110011111100111111001111111101010010101100001111111011011111101001001111111011011011011110011101100011111100111111001111110011111111010100101011000011111110110111111010010011111110110110110111100111011001000010 3f3f3f3fd4ac3fb7e93fb6de763f3f3f3fd4ac3fb7e93fb6de7642
UTF-8 쒀롍쑈뤰坎찊潔쵌衿v쒀롍쑈뤰坎찊潔쵌衿vB 111011001001001010000000111010111010000110001101111011001001000110001000111010111010010010110000111001011001110110001110111011001011000010001010111001101011110110010100111011001011010110001100111010001010000110111111011101101110110010010010100000001110101110100001100011011110110010010001100010001110101110100100101100001110010110011101100011101110110010110000100010101110011010111101100101001110110010110101100011001110100010100001101111110111011001000010 ec9280eba18dec9188eba4b0e59d8eecb08ae6bd94ecb58ce8a1bf76ec9280eba18dec9188eba4b0e59d8eecb08ae6bd94ecb58ce8a1bf7642
UHC 쒀롍쑈뤰坎찊潔쵌衿v쒀롍쑈뤰坎찊潔쵌衿vB 101111101010110010001110110100111011111010100100100011111101111011001010111011001010100110001110110011001011111010101100100011101101000011011011011101101011111010101100100011101101001110111110101001001000111111011110110010101110110010101001100011101100110010111110101011001000111011010000110110110111011001000010 beac8ed3bea48fdecaeca98eccbeac8ed0db76beac8ed3bea48fdecaeca98eccbeac8ed0db7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)