To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 夜??爰??音??嚥♂?:沃???筌??愉 100101101110100100111111001111111110000010100111001111110011111110001001101110010011111100111111100110101000101110000001100010010011111110000001010001101001011110000000001111110011111100111111111000101010001100111111001111111001011011111001 96e93f3fe0a73f3f89b93f3f9a8b81893f814697803f3f3fe2a33f3f96f9
EUC-JP 夜??爰??音??嚥♂?:沃???筌??愉 110011001110101100111111001111111110000010101001001111110011111110110010101110110011111100111111110100111110101110100001111010010011111110100001101001111100110111100000001111110011111100111111111001001010010100111111001111111100110011111011 cceb3f3fe0a93f3fb2bb3f3fd3eba1e93fa1a7cde03f3f3fe4a53f3fccfb
UTF-8 夜껋눤爰덃룚音ㅻ뙑嚥♂삳:沃왥띕츇筌먦룂愉 111001011010010010011100111010101011101110001011111010111000100010100100111001111000100010110000111010111000110110000011111010111010001110011010111010011001111110110011111000111000010110111011111010111001100110010001111001011001101010100101111000101001100110000010111011001000001010110011111011111011110010011010111001101011001010000011111011001001100110100101111010111001110110010101111011001011100010000111111001111010110110001100111010111010100010100110111010111010001110000010111001101000010010001001 e5a49ceabb8beb88a4e788b0eb8d83eba39ae99fb3e385bbeb9991e59aa5e29982ec82b3efbc9ae6b283ec99a5eb9d95ecb887e7ad8ceba8a6eba382e68489
UHC 夜껋눤爰덃룚音ㅻ뙑嚥♂삳:沃왥띕츇筌먦룂愉 111001011010100010000011111011001000011110111011111010101011101010001000111001101000111110010110111010111110010110100100111010111000110010010110111001101011111110100001110011101011101111101011101000111011101011101000101010101001111011001110101101101110101110101110100001001110111110100111100100001110001110001111100000111110101011110000 e5a883ec87bbeaba88e68f96ebe5a4eb8c96e6bfa1cebbeba3bae8aa9eceb6ebae84efa790e38f83eaf0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)