To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 脛件蹴筌削秀脛件蹴筌削秀B 11100011111110001000110010001111100011110101001011100010101000111000110111101101100011110100011111100011111110001000110010001111100011110101001011100010101000111000110111101101100011110100011101000010 e3f88c8f8f52e2a38ded8f47e3f88c8f8f52e2a38ded8f4742
EUC-JP 脛件蹴筌削秀脛件蹴筌削秀B 11100110111110101011011111101111101111011011001111100100101001011011101011101111101111011010100011100110111110101011011111101111101111011011001111100100101001011011101011101111101111011010100001000010 e6fab7efbdb3e4a5baefbda8e6fab7efbdb3e4a5baefbda842
UTF-8 脛件蹴筌削秀脛件蹴筌削秀B 11101000100001001001101111100100101110111011011011101000101110011011010011100111101011011000110011100101100010011000101011100111101001111000000011101000100001001001101111100100101110111011011011101000101110011011010011100111101011011000110011100101100010011000101011100111101001111000000001000010 e8849be4bbb6e8b9b4e7ad8ce5898ae7a780e8849be4bbb6e8b9b4e7ad8ce5898ae7a78042
UHC 脛件蹴筌削秀脛件蹴筌削秀B 11001100111010111100101111101100111101011110110111101111101001111101111011111011111000101011001111001100111010111100101111101100111101011110110111101111101001111101111011111011111000101011001101000010 ccebcbecf5edefa7defbe2b3ccebcbecf5edefa7defbe2b342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)