To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ???汝?????[???汝?????[^ 0011111100111111001111111001001111110000001111110011111100111111001111110011111101011011001111110011111100111111100100111111000000111111001111110011111100111111001111110101101101011110 3f3f3f93f03f3f3f3f3f5b3f3f3f93f03f3f3f3f3f5b5e
EUC-JP 沅??汝?????[沅??汝?????[^ 100011111100011011101001001111110011111111000110111100100011111100111111001111110011111100111111010110111000111111000110111010010011111100111111110001101111001000111111001111110011111100111111001111110101101101011110 8fc6e93f3fc6f23f3f3f3f3f5b8fc6e93f3fc6f23f3f3f3f3f5b5e
UTF-8 沅룹㏈汝낃떱吏쇗즳[沅룹㏈汝낃떱吏쇗즳[^ 111001101011001010000101111010111010001110111001111000111000111110001000111001101011000110011101111010111000001010000011111010111001011010110001111011111010011110011110111011001000011110010111111011001010011010110011010110111110011010110010100001011110101110100011101110011110001110001111100010001110011010110001100111011110101110000010100000111110101110010110101100011110111110100111100111101110110010000111100101111110110010100110101100110101101101011110 e6b285eba3b9e38f88e6b19deb8283eb96b1efa79eec8797eca6b35be6b285eba3b9e38f88e6b19deb8283eb96b1efa79eec8797eca6b35b5e
UHC 沅룹㏈汝낃떱吏쇗즳[沅룹㏈汝낃떱吏쇗즳[^ 111010101011011010110111111011001010011110111100111001101010001110000101111010101011011010110111111011001010011110111100111001101010001110000101010110111110101010110110101101111110110010100111101111001110011010100011100001011110101010110110101101111110110010100111101111001110011010100011100001010101101101011110 eab6b7eca7bce6a385eab6b7eca7bce6a3855beab6b7eca7bce6a385eab6b7eca7bce6a3855b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)