To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ???汝?????[???汝?????[^ 0011111100111111001111111001001111110000001111110011111100111111001111110011111101011011001111110011111100111111100100111111000000111111001111110011111100111111001111110101101101011110 3f3f3f93f03f3f3f3f3f5b3f3f3f93f03f3f3f3f3f5b5e
EUC-JP 薏??汝?????[薏??汝?????[^ 100011111101100111011110001111110011111111000110111100100011111100111111001111110011111100111111010110111000111111011001110111100011111100111111110001101111001000111111001111110011111100111111001111110101101101011110 8fd9de3f3fc6f23f3f3f3f3f5b8fd9de3f3fc6f23f3f3f3f3f5b5e
UTF-8 薏쎌㏈汝낆씫吏쇗즳[薏쎌㏈汝낆씫吏쇗즳[^ 111010001001011010001111111011001000111010001100111000111000111110001000111001101011000110011101111010111000001010000110111011001001010010101011111011111010011110011110111011001000011110010111111011001010011010110011010110111110100010010110100011111110110010001110100011001110001110001111100010001110011010110001100111011110101110000010100001101110110010010100101010111110111110100111100111101110110010000111100101111110110010100110101100110101101101011110 e8968fec8e8ce38f88e6b19deb8286ec94abefa79eec8797eca6b35be8968fec8e8ce38f88e6b19deb8286ec94abefa79eec8797eca6b35b5e
UHC 薏쎌㏈汝낆씫吏쇗즳[薏쎌㏈汝낆씫吏쇗즳[^ 111010111111101110111101111011001010011110111100111001101010001110000101111011001001110110111101111011001010011110111100111001101010001110000101010110111110101111111011101111011110110010100111101111001110011010100011100001011110110010011101101111011110110010100111101111001110011010100011100001010101101101011110 ebfbbdeca7bce6a385ec9dbdeca7bce6a3855bebfbbdeca7bce6a385ec9dbdeca7bce6a3855b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)