To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 汚??恂ラ?節ヨ?厓ο?鹽??歪??渦?? 10001001100110000011111100111111100111001001011010000011100010010011111110010000110111111000001110001000001111111111101010001101100000111100110100111111111010100110010000111111001111111001100001100011001111110011111110001001010100010011111100111111 89983f3f9c9683893f90df83883ffa8d83cd3fea643f3f98633f3f89513f3f
EUC-JP 汚??恂ラ?節ヨ?厓ο?鹽??歪??渦?? 1011000111111000001111110011111111010111111101101010010111101001001111111100000011100001101001011110100000111111100011111011010011000111101001101100111100111111111100111100010100111111001111111100111111000100001111110011111110110001101100100011111100111111 b1f83f3fd7f6a5e93fc0e1a5e83f8fb4c7a6cf3ff3c53f3fcfc43f3fb1b23f3f
UTF-8 汚루ㅌ恂ラ궢節ヨ풙厓ο슝鹽쇈꺂歪듸쉥渦쒐뼇 1110011010110001100110101110101110100011101010001110001110000101100011001110011010000001100000101110001110000011101010011110101010110110101000101110011110101111100000001110001110000011101010001110110110010010100110011110010110001110100100111100111010111111111011001000101010011101111010011011100110111101111011001000011110001000111010101011101010000010111001101010110110101010111010111001001110111000111011001000100110100101111001101011100010100110111011001001001010010000111010111011110010000111 e6b19aeba3a8e3858ce68182e383a9eab6a2e7af80e383a8ed9299e58e93cebfec8a9de9b9bdec8788eaba82e6adaaeb93b8ec89a5e6b8a6ec9290ebbc87
UHC 汚루ㅌ恂ラ궢節ヨ풙厓ο슝鹽쇈꺂歪듸쉥渦쒐뼇 111001111111110110110111111001111010010010111100111000101110000110101011111010011000001010110101111011111011110110101011111010001011111010011100111001001110110110100101111011111011110110111001111001111010010010111100111000111000001110101011111010001110000010110101111011111011110110101011111010001011111010011100111001111001011010010001 e7fdb7e7a4bce2e1abe982b5efbdabe8be9ce4eda5efbdb9e7a4bce383abe8e0b5efbdabe8be9ce79691

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)