To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????及鍮??恂??阿??踰??關唯? 0011111100111111001111110011111100111111001111111000101101111001111010000100101000111111001111111001110010010110001111110011111110001000101000100011111100111111111001101111101000111111001111111110100010010000100101110100001000111111 3f3f3f3f3f3f8b79e84a3f3f9c963f3f88a23f3fe6fa3f3fe89097423f
EUC-JP 濚??沅??及鍮??恂??阿??踰??關唯? 100011111100100110100001001111110011111110001111110001101110100100111111001111111011010111011010111011111010101100111111001111111101011111110110001111110011111110110000101001000011111100111111111011001111110000111111001111111110111111110000110011011010001100111111 8fc9a13f3f8fc6e93f3fb5daefab3f3fd7f63f3fb0a43f3fecfc3f3feff0cda33f
UTF-8 濚욌낌沅졾츦及鍮녕룯恂⑹㉨阿숇끃踰됮쨹關唯뉰 111001101011111110011010111011001001101010001100111010111000001010001100111001101011001010000101111011001010000110111110111011001011100010100110111001011000111110001010111010011000110110101110111010111000010110010101111010111010001110101111111001101000000110000010111000101001000110111001111000111000100110101000111010011001100010111111111011001000100010000111111010111000000110000011111010001011100010110000111010111001000010101110111011001010100010111001111010011001011110011100111001011001010010101111111010111000100110110000 e6bf9aec9a8ceb828ce6b285eca1beecb8a6e58f8ae98daeeb8595eba3afe68182e291b9e389a8e998bfec8887eb8183e8b8b0eb90aeeca8b9e9979ce594afeb89b0
UHC 濚욌낌沅졾츦及鍮녕룯恂⑹㉨阿숇끃踰됮쨹關唯뉰 1110011110111001100111101110101110110011101001101110101010110110101000001110010110101110100111001101000011100000111010111011100110110011111001111000111110100101111000101110000110101001111011001010100010111001111001001011100110011001111010111000010110111001111010111011001010001001111010011010010010010011110011101011110011101010111001101000100001000010 e7b99eebb3a6eab6a0e5ae9cd0e0ebb9b3e78fa5e2e1a9eca8b9e4b999eb85b9ebb289e9a493cebceae68842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)