To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嗚??乙??癲??韋?ぜ乙c?域??議?┿ 10011010011010100011111100111111100010011011001100111111001111111110000110011111001111110011111111101000111010000011111110000010101110101000100110110011100000101000001100111111100010001110011000111111001111111000101101100011001111111000010010111001 9a6a3f3f89b33f3fe19f3f3fe8e83f82ba89b382833f88e63f3f8b633f84b9
EUC-JP 嗚??乙??癲??韋?ぜ乙c?域??議?┿ 11010011110010110011111100111111101100101011010100111111001111111110001010100001001111110011111111110000111010100011111110100100101111001011001010110101101000111110001100111111101100001110100000111111001111111011010111000100001111111010100010111011 d3cb3f3fb2b53f3fe2a13f3ff0ea3fa4bcb2b5a3e33fb0e83f3fb5c43fa8bb
UTF-8 嗚삠겗乙댁죳癲븐룆韋뤺ぜ乙c꽳域㏃뼚議놅┿ 111001011001011110011010111011001000001010100000111010101011001010010111111001001011100110011001111010111000110010000001111011001010001110110011111001111001100110110010111010111011100010010000111010111010001110000110111010011001111110001011111010111010010010111010111000111000000110011100111001001011100110011001111011111011110110000011111010101011110110110011111001011001111110011111111000111000111110000011111010111011110010011010111010001010110110110000111010111000011010000101111000101001010010111111 e5979aec82a0eab297e4b999eb8c81eca3b3e799b2ebb890eba386e99f8beba4bae3819ce4b999efbd83eabdb3e59f9fe38f83ebbc9ae8adb0eb8685e294bf
UHC 嗚삠겗乙댁죳癲븐룆韋뤺ぜ乙c꽳域㏃뼚議놅┿ 111001111111000010111011111000111000000110101110111010111110000010110100111011001010000110001110111011111010011010111010111011001000111110000101111010101101111110001111111010001010101010111100111010111110000010100011111000111000010010111110111001101011010010100111111011001001011010100000111011001010000110000110111011111010011010111011 e7f0bbe381aeebe0b4eca18eefa6baec8f85eadf8fe8aabcebe0a3e384bee6b4a7ec96a0eca186efa6bb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)