To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 關ャ鬧育オり京髫冗ァ玖成鬧育オり京髫冗ァ毅 11101000100100001010110011101001101001111000100011100111101101011000001011101000100010111001111011101001100110101000111111100111101001111000101111101000100100001010110011101001101001111000100011100111101101011000001011101000100010111001111011101001100110101000111111100111101001111000101101000010 e890ace9a788e7b582e88b9ee99a8fe7a78be890ace9a788e7b582e88b9ee99a8fe7a78b42
EUC-JP 關ャ鬧育オり京髫冗ァ玖成鬧育オり京髫冗ァ毅 111011111111000010001110101011001111001010101001101100001110100110001110101101011010010011101010101101011111111011110001111110101011111011101001100011101010011110110110111010101100000010101110111100101010100110110000111010011000111010110101101001001110101010110101111111101111000111111010101111101110100110001110101001111011010110100011 eff08eacf2a9b0e98eb5a4eab5fef1fabee98ea7b6eac0aef2a9b0e98eb5a4eab5fef1fabee98ea7b5a3
UTF-8 關ャ鬧育オり京髫冗ァ玖成鬧育オり京髫冗ァ毅 111010011001011110011100111011111011110110101100111010011010110010100111111010001000001010110010111011111011110110110101111000111000001010001010111001001011101010101100111010011010101110101011111001011000011010010111111011111011110110100111111001111000111010010110111001101000100010010000111010011010110010100111111010001000001010110010111011111011110110110101111000111000001010001010111001001011101010101100111010011010101110101011111001011000011010010111111011111011110110100111111001101010111110000101 e9979cefbdace9aca7e882b2efbdb5e3828ae4baace9ababe58697efbda7e78e96e68890e9aca7e882b2efbdb5e3828ae4baace9ababe58697efbda7e6af85
UHC 關?鬧育?り京?冗?玖成鬧育?り京?冗?毅 1100111010111100001111111101011110100010111010111100000000111111101010101110101011001100110010000011111111101001101101110011111111001111101110001110000011110111110101111010001011101011110000000011111110101010111010101100110011001000001111111110100110110111001111111110101111110110 cebc3fd7a2ebc03faaeaccc83fe9b73fcfb8e0f7d7a2ebc03faaeaccc83fe9b73febf6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)