To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 梧??饒??要??鼇??饒??要??箋∽? 1000110011100110001111110011111111101001011000000011111100111111100101110111011000111111001111111110101010000111001111110011111111101001011000000011111100111111100101110111011000111111001111111110001010110011100000011110010000111111 8ce63f3fe9603f3f97763f3fea873f3fe9603f3f97763f3fe2b381e43f
EUC-JP 梧??饒??要??鼇??饒??要??箋∽? 1011100011101000001111110011111111110001110000010011111100111111110011011101011100111111001111111111001111100111001111110011111111110001110000010011111100111111110011011101011100111111001111111110010010110101101000101110011000111111 b8e83f3ff1c13f3fcdd73f3ff3e73f3ff1c13f3fcdd73f3fe4b5a2e63f
UTF-8 梧놂숴饒뽳슴要랃쉠鼇믭슁饒뽳슴要랃쉠箋∽스 111001101010001010100111111010111000011010000010111011001000100010110100111010011010010110010010111010111011110110110011111011001000101010110100111010001010011010000001111010111001111010000011111011001000100110100000111010011011110010000111111010111010111110101101111011001000101010000001111010011010010110010010111010111011110110110011111011001000101010110100111010001010011010000001111010111001111010000011111011001000100110100000111001111010111010001011111000101000100010111101111011001000101010100100 e6a2a7eb8682ec88b4e9a592ebbdb3ec8ab4e8a681eb9e83ec89a0e9bc87ebafadec8a81e9a592ebbdb3ec8ab4e8a681eb9e83ec89a0e7ae8be288bdec8aa4
UHC 梧놂숴饒뽳슴要랃쉠鼇믭슁饒뽳슴要랃쉠箋∽스 111001111111110010110011111011111011110110100100111010011010111010010110111011111011110110111111111010011010100110001101111011111011110110101010111010001010100010010010111011111011110110110011111010011010111010010110111011111011110110111111111010011010100110001101111011111011110110101010111011111010100010100001111011111011110110111010 e7fcb3efbda4e9ae96efbdbfe9a98defbdaae8a892efbdb3e9ae96efbdbfe9a98defbdaaefa8a1efbdba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)